Smart sampling for block family scan

ABSTRACT

A system can include a memory device and a processing device to perform operations that include determining a calibration scan frequency based on an amount of elapsed time since a previous write operation performed on the memory device, determining, based on the calibration scan frequency, whether one or more scan criteria are satisfied, responsive to determining that the one or more scan criteria are satisfied, identifying one or more block families, and calibrating one or more bin pointers of each of the identified block families, wherein the calibrating comprises: for each of the identified block families, updating each of the one or more bin pointers of the identified block family based on a data state metric of at least one block of the identified block family.

REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of application Ser. No.17/125,895 filed on Dec. 17, 2020, the disclosure of which isincorporated herein by reference in its entirety for all purposes.

TECHNICAL FIELD

Embodiments of the disclosure relate generally to memory sub-systems,and more specifically, relate to smart sampling for block family scans.

BACKGROUND

A memory sub-system can include one or more memory devices that storedata. The memory devices can be, for example, non-volatile memorydevices and volatile memory devices. In general, a host system canutilize a memory sub-system to store data at the memory devices and toretrieve data from the memory devices.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosure will be understood more fully from the detaileddescription given below and from the accompanying drawings of variousembodiments of the disclosure. The drawings, however, should not betaken to limit the disclosure to the specific embodiments, but are forexplanation and understanding only.

FIG. 1 illustrates an example computing system that includes a memorysub-system in accordance with some embodiments of the presentdisclosure.

FIG. 2 schematically illustrates the temporal voltage shift caused bythe slow charge loss exhibited by triple-level memory cells, inaccordance with some embodiments of the present disclosure.

FIG. 3 depicts an example voltage boundary table and an example voltageoffset table.

FIG. 4A depicts an example graph illustrating the dependency of thethreshold voltage offset on the time after program (i.e., the period oftime elapsed since the block has been programmed, in accordance withsome embodiments of the present disclosure.

FIG. 4B schematically illustrates a set of predefined voltage bins, inaccordance with embodiments of the present disclosure.

FIG. 5 schematically illustrates block family management operationsimplemented by a block family manager component, in accordance withembodiments of the present disclosure.

FIG. 6 schematically illustrates selecting block families forcalibration, in accordance with embodiments of the present disclosure.

FIG. 7 schematically illustrates example metadata maintained by thememory sub-system controller, in accordance with aspects of the presentdisclosure.

FIG. 8A schematically illustrates example scan metadata generated by amedia-management scan and maintained by the memory sub-systemcontroller, in accordance with aspects of the present disclosure.

FIG. 8B schematically illustrates example old blocks metadata generatedby a calibration scan and maintained by the memory sub-systemcontroller, in accordance with aspects of the present disclosure.

FIG. 9 is a flow diagram of an example method to perform a calibrationscan, in accordance with aspects of the present disclosure.

FIG. 10 is a flow diagram of an example method to perform amedia-management scan, in accordance with aspects of the presentdisclosure.

FIG. 11A schematically illustrates an example calibration scan performedby the memory sub-system controller based on a particular scanfrequency, in accordance with aspects of the present disclosure.

FIG. 11B schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency varies based on power state, in accordance with aspects of thepresent disclosure.

FIG. 11C schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency varies based on workload, in accordance with aspects of thepresent disclosure.

FIG. 11D schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency varies based on program/erase count, in accordance withaspects of the present disclosure.

FIG. 12A is a flow diagram of an example method to perform a calibrationscan in which the oldest block family within each bin is scanned, inaccordance with aspects of the present disclosure.

FIG. 12B is a flow diagram of an example method to perform a calibrationscan in which the block family prioritization and calibration scanfrequency can vary based on characteristics associated with the memorysub-system, in accordance with aspects of the present disclosure.

FIG. 12C is a flow diagram of an example method to determine blockfamily prioritization based on characteristics associated with thememory sub-system, in accordance with aspects of the present disclosure.

FIG. 12D is a flow diagram of an example method to determine calibrationscan frequency based on characteristics associated with the memorysub-system, in accordance with aspects of the present disclosure.

FIG. 13 schematically illustrates calibration scans, performed by thememory sub-system controller, in which the scan frequency varies basedon power state, in accordance with aspects of the present disclosure.

FIGS. 14A and 14B are flow diagrams of example methods to performcalibration scans in which the scan frequency varies based on powerstate, in accordance with aspects of the present disclosure.

FIG. 15 is a block diagram of an example computer system in whichembodiments of the present disclosure can operate.

DETAILED DESCRIPTION

Aspects of the present disclosure are directed to reliability scanassisted bin selection. A memory sub-system can be a storage device, amemory module, or a combination of a storage device and memory module.Examples of storage devices and memory modules are described below inconjunction with FIG. 1 . In general, a host system can utilize a memorysub-system that includes one or more components, such as memory devicesthat store data. The host system can provide data to be stored at thememory sub-system and can request data to be retrieved from the memorysub-system.

A memory sub-system can include high density non-volatile memory deviceswhere retention of data is desired when no power is supplied to thememory device. One example of non-volatile memory devices is anegative-and (NAND) memory device. Other examples of non-volatile memorydevices are described below in conjunction with FIG. 1 . A non-volatilememory device is a package of one or more dice. Each die can consist ofone or more planes. For some types of non-volatile memory devices (e.g.,NAND devices), each plane consists of a set of physical blocks. Eachblock consists of a set of pages. Each page consists of a set of memorycells (“cells”). A cell is an electronic circuit that storesinformation. Depending on the cell type, a cell can store one or morebits of binary information, and has various logic states that correlateto the number of bits being stored. The logic states can be representedby binary values, such as “0” and “1”, or combinations of such values.

Data operations can be performed by the memory sub-system. The dataoperations can be host-initiated operations. For example, the hostsystem can initiate a data operation (e.g., write, read, erase, etc.) ona memory sub-system. The host system can send access requests (e.g.,write command, read command) to the memory sub-system, such as to storedata on a memory device at the memory sub-system and to read data fromthe memory device on the memory sub-system. The data to be read orwritten, as specified by a host request, is hereinafter referred to as“host data.” A host request can include logical address information(e.g., logical block address (LBA), namespace) for the host data, whichis the location the host system associates with the host data. Thelogical address information (e.g., LBA, namespace) can be part ofmetadata for the host data. Metadata can also include error handlingdata (e.g., ECC codeword, parity code), data version (e.g. used todistinguish age of data written), valid bitmap (which LBAs or logicaltransfer units contain valid data), etc.

A memory device includes multiple memory cells, each of which can store,depending on the memory cell type, one or more bits of information. Amemory cell can be programmed (written to) by applying a certain voltageto the memory cell, which results in an electric charge being held bythe memory cell, thus allowing modulation of the voltage distributionsproduced by the memory cell. Moreover, precisely controlling the amountof the electric charge stored by the memory cell allows multiplethreshold voltage levels to be used, corresponding to different logicallevels. Multiple threshold levels allow a single memory cell to storemultiple bits of information: a memory cell operated with 2^(n)different threshold voltage levels is capable of storing n bits ofinformation. “Threshold voltage” herein shall refer to the voltage levelthat defines a boundary between two neighboring voltage distributionscorresponding to two logical levels. Thus, a read operation can beperformed by comparing the measured voltage exhibited by the memory cellto one or more threshold voltage levels in order to distinguish betweentwo logical levels for single-level cells and between multiple logicallevels for multi-level cells.

Because of the phenomenon known as slow charge loss (“SCL”), thethreshold voltage of a memory cell changes in time as the electriccharge of the cell is degrading, which is referred to as “temporalvoltage shift” (since the degrading electric charge causes the voltagedistributions to shift along the voltage axis towards lower voltagelevels). Temporal voltage shift (TVS) herein shall refer to a change inthe measured voltage of cells as a function of time. Temporal VoltageShift can include different components such as intrinsic charge loss,system charge loss, quick charge loss, etc. Memory formed from certainNAND technologies generally exhibits more TVS than floating gate NAND.TVS is generally increased by Program Erase Cycles (PEC), highertemperatures, and higher program voltages. TVS shows significantdie-to-die variation. In memory that exhibits TVS, the threshold voltageis changing rapidly at first (immediately after the memory cell wasprogrammed), and then slows down in an approximately logarithmic linearfashion with respect to the time elapsed since the cell programmingevent. If not mitigated, the temporal voltage shift caused by the slowcharge loss can result in the increased bit error rate in readoperations.

Temporal voltage shift can be mitigated by implementing a memorysub-system that employs block family based error avoidance strategies,thus significantly improving the bit error rate exhibited by the memorysub-system. In accordance with embodiments of the present disclosure,the temporal voltage shift is selectively tracked for a programmed setof memory cells grouped by block families. Appropriate voltage “readlevel offsets,” which are based on block affiliation with a certainblock family, are applied to the base read levels to perform readoperations. “Block” herein shall refer to a set of contiguous ornon-contiguous memory pages. An example of “block” is “erasable block,”which is the minimal erasable unit of memory, while “page” is a minimalwritable unit of memory. Each page includes of a set of memory cells. Amemory cell is an electronic circuit that stores information. “Blockfamily” herein shall refer to a possibly noncontiguous set of memorycells (which can reside in one or more full and/or partial blocks, thelatter referred to as “partitions” herein) that have been programmedwithin a specified time window and a specified temperature window, andthus are expected to exhibit similar or correlated changes in theirrespective data state metrics. A block family can be made with anygranularity, containing only whole codewords, whole pages, whole superpages, or whole superblocks, or any combination of these. In someimplementations, base read levels can be stored in the metadata of thememory device. “Data state metric” herein shall refer to a quantity thatis measured or inferred from the state of data stored on a memorydevice. Specifically, the data state metrics can reflect the state ofthe temporal voltage shift, the degree of read disturb, and/or othermeasurable functions of the data state as will be discussed in moredetail. A composite data state metric is a function (e.g., a weightedsum) of a set of component state metrics.

“Read level” herein shall refer to a voltage position. Read levels arenumbered in increasing voltage from L1 through 2^(n), wherein n is thenumber of bits that can be stored in the cell. “Read level value” hereinshall refer to a voltage or DAC value representing a voltage that thatis applied to the read element (often, the control gate for a NAND cell)for purposes of reading that cell. “Read level offset” herein shallrefer to a component of the equation that determines the read levelvalue. “Calibration” herein shall refer to altering a read level value(possibly by adjusting a read level offset or read level base) to bettermatch the ideal read levels for a read or set of reads.

“Bin” (or “voltage bin” or “voltage offset bin”) herein shall refer to aset of read level offsets that are applied to a set of data. The binoffsets are read level offsets that affect the read level for blockfamilies within the bin. An old or older bin is one where the read leveloffsets are directed at data that was written at a relatively earlytime. A young or younger bin is one where the read level offsets aredirected at data written relatively recently. “Bin selection” hereinshall refer to the process by which the memory device selects which binto use for a given read.

The memory sub-system controller can periodically perform a calibrationprocess (also referred to as a calibration scan) to update theassociations between block families and bins. Each block family isassociated with a set of dies on which blocks of the block family arestored. The association of a block family and dies with voltage bins canbe represented by a set of bin pointers that includes a bin pointer foreach die of the block family. For a particular block family, eachparticular die is associated with a bin pointer that identifies (“pointsto”) a voltage bin, thereby establishing an association between theblock family and the voltage bin for the particular die. Bins can beidentified by bin numbers (e.g., numbers between 0 and 7 in an 8 voltagebin architecture). Each bin pointer can thus be a bin number. Theassociations of blocks with block families and block families and dieswith voltage bins can be stored in respective metadata tables maintainedby the memory sub-system controller (e.g., as bin numbers in themetadata tables).

The calibration scan can evaluate a data state metric (e.g., a voltageshift or bit error rate) for each die of each block family with one of aset of predefined voltage bins, e.g., by, for each die of each blockfamily, measuring a value of data state metric of a block (of the blockfamily) stored on the die. The calibration scan can then update a binpointer associated with the die and block family to point to a voltagebin that corresponds to the measured value of the data state metric.Each voltage bin is in turn associated with a voltage offset to beapplied for read operations. For example, the bin pointer can remain thesame if the data state metric is in a range associated with the existingbin pointer, or can be changed to point to an older bin if the datastate metric is in a range associated with the older bin. Although ablock family can be associated (by bin pointers) with multiple differentbins, a block family is herein referred to as being associated with(“in”) a particular one of the bins. More particularly, a block familyis associated with (or in) the oldest bin with which a die of the blockfamily is associated.

Generally, the temporal voltage shift for younger block families (i.e.,block families that are more recently created) is more significant thanthe temporal voltage shift for older block families (i.e., blockfamilies that are less recently created). The memory sub-systemcontroller can periodically perform the calibration scan for each blockfamily based on the age of the block family, which corresponds to thevoltage bin associated with the block family. For example, in an 8voltage bin architecture, newly created block families can be associatedwith voltage bin 0, while the oldest (i.e., least recently created)block families are associated with voltage bin 7. The memory sub-systemcontroller performs the calibration scan for the block families involtage bin 0 more frequently than for the block families in voltage bin7, based on the age of the block families associated with voltage bin 0(e.g., based on the logarithmic linear nature of SCL).

Since the calibration scan involves obtaining measurements of a currentstate of data at each scanned block family and performing calculationsbased on the measured state data, the calibration scan can utilize asignificant amount of memory sub-system resources. Accordingly, thecalibration scan can increase latency and power consumption of thememory sub-system. Thus, the calibration scan should be performedinfrequently to minimize the reduction in system performance. However,the calibration scan is time-sensitive, because miscalibration can occurwhen bin pointers associated with a block family are not updated in atimely manner to compensate for the temporal voltage shift of datastored in blocks associated with the block family. Such miscalibrationcan result in read errors that can adversely affect the memorysub-system's performance. Thus, the calibration scan should be performedfrequently to avoid miscalibration and read errors. Determining theappropriate calibration scan frequency can be difficult, becausefrequent calibration scans can avoid miscalibration and read errors, butcan also reduce system performance by consuming system resources such asprocessor time.

Aspects of the present disclosure address the above and otherdeficiencies by adjusting the calibration scan to adapt to changingcharacteristics of the memory sub-system. These characteristics of thememory sub-system are referred to herein as calibration demandcharacteristics, and can include system state characteristics, such asthe power state of the memory sub-system or host, as well ascharacteristics of data being stored, such as workload characteristics.The calibration demand characteristics can include block family age,power state of the memory sub-system or host (e.g., active, idle,hibernating, or sleeping), workload, program/erase cycle counts of thememory device, or read error rates. The calibration scan can be adjustedto adapt to changing calibration demand characteristics by, for example,changing the calibration scan frequency in accordance with thecalibration demand characteristics. The calibration scan adjustments canbe performed in response to detecting that threshold calibration demandcriteria are satisfied by the calibration demand characteristics. Forexample, a set of calibration demand ranges, such as ranges of readerror rates, can be associated with a set of corresponding calibrationscan adjustment parameters that specify how to adjust the calibrationscan for each of the ranges. Alternatively, other suitable functionsthat map calibration demand to calibration scan adjustment parameterscan be used, such as linear or quadratic functions. Thus, thecalibration scan can be adjusted based on the characteristics associatedwith the memory sub-system by modifying scan parameters such as the scanfrequency or other criteria that control the initiation of calibrationscans.

As an example, the calibration scan can be adjusted based on blockfamily age by selecting a subset of the block families, then scanningthe subset of block families, but not other block families. The subsetof block families to be scanned can include, for example, the oldestblock family in each voltage bin. Alternatively or additionally, asubset of the block families can be prioritized so that the blockfamilies in the subset are scanned prior to other block families. Thesubset of block families to be prioritized can be determined based onread error rates associated with the block families, so that blockfamilies associated with read error rates greater than a threshold errorrate are scanned prior to other block families having error rates belowthe threshold. Alternatively, a priority can be associated with eachblock family based on the block family's error rate, and the blockfamilies can be scanned in order according to their priorities.

Calibration scans can be performed based on the power state of thememory sub-system or host system. In an idle state, the memorysub-system can devote a substantial amount of processing time toperforming scan operations because few if any host reads or writes areexpected to occur. Thus, in response to a transition from an activestate 1114 to an idle state 1116, the block family manager component 113can increase the scan frequency. A calibration scan can be performed inresponse to determining that at least one threshold criterion issatisfied. The threshold criterion can be based on a power state of thememory sub-system or host system. For example, the threshold criterioncan be satisfied when the memory sub-system is in an idle power state,or when the memory sub-system transitions to an active power state froma sleep state or low power state. Further, the calibration scanfrequency can be adjusted in proportion to a workload intensity, such asthe number of write operations performed by the memory sub-system over aperiod of time, or in proportion to a number of program/erase cyclesassociated with a memory device.

Advantages of the present disclosure include, but are not limited to,decreasing a number operations, and thus the amount of time, used inblock family calibration scans. For example, data state metricmeasurement operations performed by calibration scans consume asubstantial amount of time. Using the adaptive scan frequency techniquesdisclosed herein, the memory sub-system is able to perform calibrationat high frequencies (with low latencies) during times of higher demandfor calibration, so that bin pointers can more accurately reflect thefaster rate of charge loss for younger bins. Calibration can beperformed at low frequencies during times in which low frequencies aresufficient to maintain read level calibration. Thus, the memorysub-system can reduce the amount of processing time used by thecalibration scan during time periods in which high-frequency scans arenot needed, while retaining the benefits of high-frequency scans duringperiods of greater demand, which correspond to periods of greater timesensitivity.

Since the calibration scan frequency can be increased during periods ofgreater calibration demand, and reduced during periods of lowercalibration demand, the assignment of voltage bins to block families canbe maintained with greater accuracy while using fewer computationalresources during the periods of lower calibration demand. For example, aunit of programmed data in a block family that is near transition toanother voltage bin can be older than the age threshold for itscurrently-associated voltage bin, thereby causing reads to use aless-accurate read level offset for that unit of programmed data. Theunit of programmed data is eventually re-assigned to an appropriateolder bin by the calibration scan. The delay until this re-assignmentcan be reduced by prioritizing the oldest block family within each bin,and/or increasing the rate at which bins are scanned, e.g., byincreasing the scan frequency when the workload is high, usingtechniques disclosed herein. As another example, if the unit ofprogrammed data is older than the age threshold for its associatedvoltage bin, attempts to read the unit of data or associated units ofdata (e.g., other units in the same page or block) can cause anincreased read error rate for the unit's block. Re-assignment of theunit to an appropriate bin can be expedited by prioritizing blockshaving high error rates in the calibration scan, using techniquesdisclosed herein.

Further, since the calibration scan can identify particular blockfamilies to scan, such as the oldest block family in teach bin, or blockfamilies associated with blocks having high error rates, the number ofscanned blocks can be reduced. Thus, a significant amount of memorysub-system resources can be made available for other processes, eitherin combination with or as an alternative to the increased calibrationscan frequency benefits described above. This availability of memorysystem resources for uses other than the calibration scan results in adecrease in overall memory sub-system latency and an increase in overallmemory sub-system efficiency.

FIG. 1 illustrates an example computing system 100 that includes amemory sub-system 110 in accordance with some embodiments of the presentdisclosure. The memory sub-system 110 can include media, such as one ormore volatile memory devices (e.g., memory device 140), one or morenon-volatile memory devices (e.g., memory device 130), or a combinationof such.

A memory sub-system 110 can be a storage device, a memory module, or ahybrid of a storage device and memory module. Examples of a storagedevice include a solid-state drive (SSD), a flash drive, a universalserial bus (USB) flash drive, an embedded Multi-Media Controller (eMMC)drive, a Universal Flash Storage (UFS) drive, a secure digital (SD)card, and a hard disk drive (HDD). Examples of memory modules include adual in-line memory module (DIMM), a small outline DIMM (SO-DIMM), andvarious types of non-volatile dual in-line memory module (NVDIMM).

The computing system 100 can be a computing device such as a desktopcomputer, laptop computer, network server, mobile device, a vehicle(e.g., airplane, drone, train, automobile, or other conveyance),Internet of Things (IoT) enabled device, embedded computer (e.g., oneincluded in a vehicle, industrial equipment, or a networked commercialdevice), or such computing device that includes memory and a processingdevice.

The computing system 100 can include a host system 120 that is coupledto one or more memory sub-systems 110. In some embodiments, the hostsystem 120 is coupled to different types of memory sub-system 110. FIG.1 illustrates one example of a host system 120 coupled to one memorysub-system 110. As used herein, “coupled to” or “coupled with” generallyrefers to a connection between components, which can be an indirectcommunicative connection or direct communicative connection (e.g.,without intervening components), whether wired or wireless, includingconnections such as electrical, optical, magnetic, etc.

The host system 120 can include a processor chipset and a software stackexecuted by the processor chipset. The processor chipset can include oneor more cores, one or more caches, a memory controller (e.g., NVDIMMcontroller), and a storage protocol controller (e.g., PCIe controller,SATA controller). The host system 120 uses the memory sub-system 110,for example, to write data to the memory sub-system 110 and read datafrom the memory sub-system 110.

The host system 120 can be coupled to the memory sub-system 110 via aphysical host interface. Examples of a physical host interface include,but are not limited to, a serial advanced technology attachment (SATA)interface, a peripheral component interconnect express (PCIe) interface,universal serial bus (USB) interface, Fibre Channel, Serial AttachedSCSI (SAS), a double data rate (DDR) memory bus, Small Computer SystemInterface (SCSI), a dual in-line memory module (DIMM) interface (e.g.,DIMM socket interface that supports Double Data Rate (DDR)), etc. Thephysical host interface can be used to transmit data between the hostsystem 120 and the memory sub-system 110. The host system 120 canfurther utilize an NVM Express (NVMe) interface to access components(e.g., memory devices 130) when the memory sub-system 110 is coupledwith the host system 120 by the PCIe interface. The physical hostinterface can provide an interface for passing control, address, data,and other signals between the memory sub-system 110 and the host system120. FIG. 1 illustrates a memory sub-system 110 as an example. Ingeneral, the host system 120 can access multiple memory sub-systems viaa same communication connection, multiple separate communicationconnections, and/or a combination of communication connections.

The memory devices 130, 140 can include any combination of the differenttypes of non-volatile memory devices and/or volatile memory devices. Thevolatile memory devices (e.g., memory device 140) can be, but are notlimited to, random access memory (RAM), such as dynamic random accessmemory (DRAM) and synchronous dynamic random access memory (SDRAM).

Some examples of non-volatile memory devices (e.g., memory device 130)include negative-and (NAND) type flash memory and write-in-place memory,such as three-dimensional cross-point (“3D cross-point”) memory. Across-point array of non-volatile memory can perform bit storage basedon a change of bulk resistance, in conjunction with a stackablecross-gridded data access array. Additionally, in contrast to manyflash-based memories, cross-point non-volatile memory can perform awrite in-place operation, where a non-volatile memory cell can beprogrammed without the non-volatile memory cell being previously erased.NAND type flash memory includes, for example, two-dimensional NAND (2DNAND) and three-dimensional NAND (3D NAND).

Each of the memory devices 130 can include one or more arrays of memorycells. One type of memory cell, for example, single level cells (SLC)can store one bit per cell. Other types of memory cells, such asmulti-level cells (MLCs), triple level cells (TLCs), and quad-levelcells (QLCs), can store multiple bits per cell. In some embodiments,each of the memory devices 130 can include one or more arrays of memorycells such as SLCs, MLCs, TLCs, QLCs, or any combination of such. Insome embodiments, a particular memory device can include an SLC portion,and an MLC portion, a TLC portion, or a QLC portion of memory cells. Thememory cells of the memory devices 130 can be grouped as pages that canrefer to a logical unit of the memory device used to store data. Withsome types of memory (e.g., NAND), pages can be grouped to form blocks.

Although non-volatile memory devices such as 3D cross-point array ofnon-volatile memory cells and NAND type flash memory (e.g., 2D NAND, 3DNAND) are described, the memory device 130 can be based on any othertype of non-volatile memory, such as read-only memory (ROM), phasechange memory (PCM), self-selecting memory, other chalcogenide basedmemories, ferroelectric transistor random-access memory (FeTRAM),ferroelectric random access memory (FeRAM), magneto random access memory(MRAM), Spin Transfer Torque (STT)-MRAM, conductive bridging RAM(CBRAM), resistive random access memory (RRAM), oxide based RRAM(OxRAM), negative-or (NOR) flash memory, electrically erasableprogrammable read-only memory (EEPROM).

A memory sub-system controller 115 (or controller 115 for simplicity)can communicate with the memory devices 130 to perform operations suchas reading data, writing data, or erasing data at the memory devices 130and other such operations. The memory sub-system controller 115 caninclude hardware such as one or more integrated circuits and/or discretecomponents, a buffer memory, or a combination thereof. The hardware caninclude a digital circuitry with dedicated (i.e., hard-coded) logic toperform the operations described herein. The memory sub-systemcontroller 115 can be a microcontroller, special purpose logic circuitry(e.g., a field programmable gate array (FPGA), an application specificintegrated circuit (ASIC), etc.), or other suitable processor.

The memory sub-system controller 115 can include a processor 117 (e.g.,processing device) configured to execute instructions stored in localmemory 119. In the illustrated example, the local memory 119 of thememory sub-system controller 115 includes an embedded memory configuredto store instructions for performing various processes, operations,logic flows, and routines that control operation of the memorysub-system 110, including handling communications between the memorysub-system 110 and the host system 120.

In some embodiments, the local memory 119 can include memory registersstoring memory pointers, fetched data, etc. The local memory 119 canalso include read-only memory (ROM) for storing micro-code. While theexample memory sub-system 110 in FIG. 1 has been illustrated asincluding the memory sub-system controller 115, in another embodiment ofthe present disclosure, a memory sub-system 110 does not include amemory sub-system controller 115, and can instead rely upon externalcontrol (e.g., provided by an external host, or by a processor orcontroller separate from the memory sub-system).

In general, the memory sub-system controller 115 can receive commands oroperations from the host system 120 and can convert the commands oroperations into instructions or appropriate commands to achieve thedesired access to the memory devices 130. The memory sub-systemcontroller 115 can be responsible for other operations such as wearleveling operations, garbage collection operations, error detection anderror-correcting code (ECC) operations, encryption operations, cachingoperations, and address translations between a logical address (e.g.,logical block address (LBA), namespace) and a physical address (e.g.,physical block address) that are associated with the memory devices 130.The memory sub-system controller 115 can further include host interfacecircuitry to communicate with the host system 120 via the physical hostinterface. The host interface circuitry can convert the commandsreceived from the host system into command instructions to access thememory devices 130 as well as convert responses associated with thememory devices 130 into information for the host system 120.

In some implementations, memory sub-system 110 can use a stripingscheme, according to which every data payload (e.g., user data) utilizesmultiple dies of the memory devices 130, 140 (e.g., NAND type flashmemory devices), such that the payload is distributed through a subsetof dies, while the remaining one or more dies are used to store theerror correction information (e.g., party bits). Accordingly, a set ofblocks distributed across a set of dies of a memory device using astriping scheme is referred herein to as a “superblock.”

The memory sub-system 110 can also include additional circuitry orcomponents that are not illustrated. In some embodiments, the memorysub-system 110 can include a cache or buffer (e.g., DRAM) and addresscircuitry (e.g., a row decoder and a column decoder) that can receive anaddress from the memory sub-system controller 115 and decode the addressto access the memory devices 130.

In some embodiments, the memory devices 130 include local mediacontrollers 135 that operate in conjunction with memory sub-systemcontroller 115 to execute operations on one or more memory cells of thememory devices 130. An external controller (e.g., memory sub-systemcontroller 115) can externally manage the memory device 130 (e.g.,perform media management operations on the memory device 130). In someembodiments, a memory device 130 is a managed memory device, which is araw memory device combined with a local controller (e.g., localcontroller 135) for media management within the same memory devicepackage. An example of a managed memory device is a managed NAND (MNAND)device.

The memory sub-system 110 includes a block family manager component 113that can select voltage bins to be associated with block families at amemory device, e.g., by performing calibration scans. The block familymanager component 113 or another component of memory sub-systemcontroller 115 can perform media-management scans, which can performmedia management operations on the memory device 130. The block familymanager component 113 can also modify the calibration scans, e.g., bymodifying parameters of the scans, based on characteristics associatedwith the memory device. In some embodiments, the memory sub-systemcontroller 115 includes at least a portion of the block family managercomponent 113. For example, the memory sub-system controller 115 caninclude a processor 117 (processing device) configured to executeinstructions stored in local memory 119 for performing the operationsdescribed herein. In some embodiments, the block family managercomponent 113 is part of the host system 110, an application, or anoperating system. Further details regarding block families and blockfamily manager component 113 are described below.

FIG. 2 illustrates the temporal voltage shift caused at least in part bythe slow charge loss exhibited by triple-level memory cells, inaccordance with embodiments of the disclosure. While the illustrativeexample of FIG. 2 utilizes triple-level cells, the same observations canbe made and, accordingly, the same remedial measures are applicable tosingle level cells and any memory cells having multiple levels.

A memory cell can be programmed (written to) by applying a certainvoltage (e.g. program voltage) to the memory cell, which results in anelectric charge stored by the memory cell. Precisely controlling theamount of the electric charge stored by the memory cell allows a memorycell to have multiple threshold voltage levels that correspond todifferent logical levels, thus effectively allowing a single memory cellto store multiple bits of information. A memory cell operated with 2^(n)different threshold voltage levels is capable of storing n bits ofinformation.

Each of chart 210 and 230 illustrate program voltage distributions220A-420N (also referred to as “program distributions” or “voltagedistributions” or “distributions” herein) of memory cells programmed bya respective write level (which can be assumed to be at the midpoint ofthe program distribution) to encode a corresponding logical level. Theprogram distributions 220A through 220N can illustrate the range ofthreshold voltages (e.g., normal distribution of threshold voltages) formemory cells programmed at respective write levels (e.g., programvoltages). In order to distinguish between adjacent programdistributions (corresponding to two different logical levels), the readthreshold voltage levels (shown by dashed vertical lines) are defined,such that any measured voltage that falls below a read threshold levelis associated with one program distribution of the pair of adjacentprogram distributions, while any measured voltage that is greater thanor equal to the read threshold level is associated with another programdistribution of the pair of neighboring distributions.

In chart 210, eight states of the memory cell are shown belowcorresponding program distributions (except for the state labeled ER,which is an erased state, for which a distribution is not shown). Eachstate corresponds to a logical level. The threshold voltage levels arelabeled Va-Vh. As shown, any measured voltage below Va is associatedwith the ER state. The states labeled P1, P2, P3, P4, P5, P6, and P7correspond to distributions 22A-220N, respectively.

Time After Program (TAP) herein shall refer to the time since a cell hasbeen written and is the primary driver of TVS (temporal voltage shift).TAP can be estimated (e.g., inference from a data state metric), ordirectly measured (e.g., from a controller clock). A cell, block, page,block family, etc. is young (or, comparatively, younger) if it has a(relatively) small TAP and is old (or, comparatively, older) if it has a(relatively) large TAP. A time slice is a duration between two TAPpoints during which a measurement can be made (e.g., perform referencecalibration from 8 to 12 minutes after program). A time slice can bereferenced by its center point (e.g., 10 minutes).

As seen from comparing example charts 210 and 230, which reflect thetime after programming (TAP) of 0 (immediately after programming) andthe TAP of T hours (where T is a number of hours), respectively, theprogram distributions change over time due primarily to slow chargeloss. In order to reduce the read bit error rate, the corresponding readthreshold voltages are adjusted to compensate for the shift in programdistributions, which are shown by dashed vertical lines. In variousembodiments of the disclosure, the temporal voltage shift is selectivelytracked for die groups based on measurements performed at one or morerepresentative dice of the die group. Based on the measurements made onrepresentative dice of a die group that characterize the temporalvoltage shift and operational temperature of the dice of the die group,the read threshold voltage offsets used to read the memory cells for thedice of the die group are updated and are applied to the base readthreshold levels to perform read operations.

FIG. 3 depicts an example voltage boundary table and an example voltageoffset table. The voltage boundary table 310 and the voltage offsettable 320 can be used to determine read level offsets, which are addedto a base read level voltage to read data from memory cells. As the timeafter program increases, the threshold voltages for the distribution ofthe memory cell can change as a result of storage charge loss, asillustrated in the example of FIG. 2 . To determine the appropriate readlevel offset for reading a cell, a measurement of the cell can beperformed to estimate the time after program of the cell based on datastate metrics such as voltages. For example, the level 7 distribution ofthe cell can be measured, and the difference between the measured readlevel (e.g., 100 millivolts) and a reference read level (e.g., 0 volts)can be determined. The difference corresponds to the time after programof the memory cell, and can be used to identify a read level offset toadd to the base read threshold level to perform read operations.

The voltage boundary table 310 can be used to identify a bin thatcontains read offsets for use in reading data from the memory cell. Thebin to be used is the value of the Bin column for which the voltagedifference (between the measured read level and the reference readlevel) corresponds to a voltage range shown in the Boundaries column.For example, if the difference is less than V1, then bin 0 is to beused. If the difference is between V1 and V2, then bin 1 is to be used,and so on. The voltage offsets table can be used to identify the readlevel offsets to be used for the identified bin. For example, if the binto be used is bin 0, then the corresponding one of the offsets shown inthe column labeled “Bin 0” 322 (e.g., V10, V20, . . . V60) is to beadded to the base read offset level (and any other offsets) for each oflevels 1-7 when reading the memory cell. “Bin 0” 322 corresponds to thetime after program of 0 hours shown in FIG. 2 . A column “Bin 5” 324corresponds to the time after program of T hours shown in FIG. 2 , andhas offsets of greater magnitude. Bin numbers less than a threshold agevalue can be referred to as “younger bins” and bin numbers greater thanor equal to the threshold age value can be referred to as “older bins.”For example, if the threshold age value is the bin number 5, then bins0-4 can be referred to as younger bins, and bins 5-7 can be referred toas older bins. As another example, if the threshold age value is the binnumber 4, then bins 0-3 can be referred to as younger bins, and bins 4-7can be referred to as older bins.

As described above, “read level” herein shall refer to a voltageposition. Read levels are numbered in increasing voltage from L1 through2{circumflex over ( )} (number of bits). As an example, for TLC, theread levels would be L1, L2, . . . , L7. “Read level value” herein shallrefer to a voltage or DAC value representing a voltage that that isapplied to the read element (often, the control gate for a NAND cell)for purposes of reading that cell. “Read level offset” herein shallrefer to a component of the equation that determines the read levelvalue. Offsets can be summed (i.e., read level value=offset_a+offset_b+. . . ). By convention, one of the read level offsets can be called theread level base. “Calibration” herein shall refer to altering a readlevel value (possibly by adjusting a read level offset or read levelbase) to better match the ideal read levels for a read or set of reads.

As described above, “bin” (or “voltage bin” or “voltage offset bin”)herein shall refer to a set of read level offsets that are applied to aset of data. The bin offsets are read level offsets that affect the readlevel for block families within the bin. In this context, a bin isusually primarily directed at addressing TVS, but can also be directedat other mechanisms (e.g., temperature coefficient (tempco)miscalibration). An old or older bin is one where the read level offsetsare directed at data that was written at a relatively early time. Ayoung or younger bin is one where the read level offsets are directed atdata written relatively recently. The read level adjustments can beimplemented through either offsets or read retries, or even as anadjustment to the base. Bin selection herein shall refer to the processby which the memory device selects which bin to use for a given read.

FIG. 4A depicts an example graph 400 illustrating the dependency of thethreshold voltage offset on the time after program (i.e., the period oftime elapsed since the block has been programmed, in accordance withsome embodiments of the present disclosure. As schematically illustratedby FIG. 4A, blocks families of the memory device are grouped into bins430A-430N, such that each block family includes one or more blocks thathave been programmed within a specified time window and a specifiedtemperature window. As noted herein above, since the time elapsed afterprogramming and temperature are the main factors affecting the temporalvoltage shift, all blocks and/or partitions within a single blockfamily—are presumed to exhibit similar distributions of thresholdvoltages in memory cells, and thus would require the same voltageoffsets for read operations.

Block families can be created asynchronously with respect to blockprogramming events. In an illustrative example, the memory sub-systemcontroller 115 of FIG. 1 can create a new block family whenever aspecified period of time (e.g., a predetermined number of minutes) haselapsed since creation of the last block family or whenever thereference temperature of memory cells, which is updated at specifiedtime intervals, has changed by more than a specified threshold valuesince creation of the current block family. The memory sub-systemcontroller can maintain an identifier of the active block family, whichis associated with one or more blocks as they are being programmed.

A newly created block family can be associated with bin 0. Then, thememory sub-system controller can periodically perform a calibrationprocess in order to associate each die of every block family with one ofthe predefined voltage bins (bins 0-7 in the illustrative example ofFIG. 4A), which is in turn associated with the voltage offset to beapplied for read operations. The associations of blocks with blockfamilies and block families and dice with voltage bins can be stored inrespective metadata tables maintained by the memory sub-systemcontroller, such as metadata tables described with respect to FIG. 7below.

FIG. 4B schematically illustrates a set of predefined threshold voltagebins, in accordance with embodiments of the present disclosure. Asschematically illustrated by FIG. 4B, the threshold voltage offset graph450 can be subdivided into multiple voltage bins, such that each voltagebin corresponds to a predetermined range of threshold voltage offsets.While the illustrative example of FIG. 4B defines ten voltage bins, inother implementations, various other numbers of voltage bins can beemployed (e.g., 64 bins). The memory sub-system controller can associateeach die of every block family with a voltage bin, based on aperiodically performed calibration process, described in further detailbelow.

FIG. 5 schematically illustrates block family management operationsimplemented by the block family manager component 113 of the memorysub-system controller 115, in accordance with embodiments of the presentdisclosure. As schematically illustrated by FIG. 5 , the block familymanager component 113 can maintain, in a memory variable, an identifier520 of the active block family, which is associated with one or moreblocks of cursors 530A-530K as they are being programmed. “Cursor”herein shall broadly refer to a location on the memory device to whichthe data is being written.

The memory sub-system controller can utilize a power on minutes (POM)clock for tracking the creation times of block families. In someimplementations, a less accurate clock, which continues running when thecontroller is in various low-power states, can be utilized in additionto the POM clock, such that the POM clock is updated based on the lessaccurate clock upon the controller wake-up from the low-power state.

Thus, upon initialization of each block family, block family managercomponent 113 stores the current time 540 in a memory variable as theblock family start time 550. As the blocks are programmed, block familymanager component 113 compares the current time 540 to the block familystart time 550. Responsive to detecting that the difference of thecurrent time 540 and the block family start time 550 is greater than orequal to the specified time period (e.g., a predetermined number ofminutes), block family manager component 113 updates the memory variablestoring the active block family identifier 520 to store the next blockfamily number (e.g., the next sequential integer number), and the memoryvariable storing the block family start time 550 is updated to store thecurrent time 540.

The block family manager 510 also maintains two memory variables forstoring the high and low reference temperatures of a selected die ofeach memory device. Upon initialization of each block family, the hightemperature 560 and the low temperature 570 variable store the value ofthe current temperature of the selected die of the memory device. Inoperation, while the active block family identifier 520 remains thesame, temperature measurements are periodically obtained and comparedwith the stored high temperature 560 and the low temperature 570 values,which are updated accordingly: should the temperature measurement befound to be greater than or equal to the value stored by the hightemperature variable 560, the latter is updated to store thattemperature measurement; conversely, should the temperature measurementbe found to fall below the value stored by the low temperature variable570, the latter is updated to store that temperature measurement.

The block family manager 510 can further periodically compute thedifference between the high temperature 560 and the low temperature 570.Responsive to determining that the difference between the hightemperature 560 and the low temperature 570 is greater than or equal toa specified temperature threshold, the block family manager 510 cancreate a new active block family: the memory variable storing the activeblock family identifier 520 is updated to store the next block familynumber (e.g., the next sequential integer number), the memory variablestoring the block family start time 550 is updated to store the currenttime 540, and the high temperature 560 and the low temperature 570variables are updated to store the value of the current temperature ofthe selected die of the memory device.

At the time of programming a block, block family manager component 113associates the block with the currently active block family. Theassociation of each block with a corresponding block family is reflectedby the block family metadata 710, as described in more detail hereinbelow with reference to FIG. 7 .

As described previously, based on a periodically performed calibrationprocess, the block family manager component 113 associates each die ofevery block family with a voltage bin, which defines a set of thresholdvoltage offsets to be applied to the base voltage read level in order toperform read operations. The calibration process involves performing,with respect to a specified number of randomly selected blocks withinthe block family that is being calibrated, read operations utilizingdifferent threshold voltage offsets, and choosing the threshold voltageoffset that minimizes the error rate of the read operation. Block familymanager 113 determines the particular voltage bin corresponding to thechosen threshold voltage offset and updates metadata for the blockfamily to correspond to the determined voltage bin.

In some embodiments, the frequency at which the memory sub-systemcontroller performs the calibration process for each voltage bin can bebased on an age of the block families associated with the voltage bin.As described previously with respect to FIG. 4A, newly created blockfamilies can be associated with voltage bin 0 and older block familieson the memory device can be associated with subsequently numberedvoltage bins. The temporal voltage shift for block families in a youngervoltage bin is more significant than the temporal voltage shift forblock families associated with an older voltage bin. This is illustratedin FIG. 4B, as the voltage offset for bin 0 shifts at quicker rate thanthe voltage offset for older voltage bins (e.g., voltage bins 9, 8, 7,etc.). Therefore, the memory sub-system controller can perform thecalibration process for block families associated with voltage bin 0 ata higher frequency than for block families associated with voltage bin 9to associate each block family with an appropriate voltage bin.

FIG. 6 schematically illustrates selecting block families forcalibration, in accordance with embodiments of the present disclosure.Three bins are shown, named bin 0, bin 1, and bin 2. Bin 0 includesblock families 610, 612, and 614. Bin 1 includes block families 620,622, and 626. Bin 2 includes block family 628. Due to slow charge loss,the oldest block families in a voltage bin will migrate to the nextvoltage bin before any other block families of the current bin. As such,the memory sub-system controller can limit calibration operations to theoldest block families in a bin (e.g., block family 610 in bin 0 andblock family 620 in bin 1). In some embodiments, the memory sub-systemcontroller can identify the oldest block family in a voltage bin basedon a bin boundary 616 for the bin. A bin boundary 515 can represent aboundary between two adjacent block families that are each associatedwith a different bin. The memory sub-system controller can identify thebin boundary 616 for a particular voltage bin using a block familymetadata table, described in further detail below. The bin boundary 616is between bins 0 and 1. A second bin boundary 624 is between bins 1 and2.

FIG. 7 schematically illustrates example metadata maintained by thememory sub-system controller, in accordance with aspects of the presentdisclosure. In some embodiments, block family manager component 113 canmaintain a block metadata table 710 and a block family metadata table720. In some embodiments, block metadata table 710 and/or block familymetadata table 720 can be stored in memory of the memory sub-system(e.g., at memory device 130, 140, local memory 119, etc.) and can bereferenced by block family manager component 113 to determine a blockfamily associated with a particular block and/or a voltage binassociated with the block family. As illustrated in FIG. 7 , blockmetadata table 710 and block family metadata table 720 can be separatemetadata tables. In other or similar embodiments, block metadata table710 and block family metadata table 720 can be included in a singlemetadata table. Additionally or alternatively, block metadata table 710and block family metadata table 720 can be included in other metadatatables maintained by block family manager component 113, such as asuperblock table, an offset table, etc.

In some embodiments, the block metadata table 710 can be indexed byblock family and each entry of the block metadata table 710 can includean indication of one or more blocks, spanning one or more die, includedin a block family. As illustrated in FIG. 7 , block metadata table 710is indexed by block family and includes an indication of a range ofblocks included in each block family. In other or similar embodiments,the block metadata table 710 can be indexed by block and each entry caninclude an indication of the block family associated with the block.Each entry of block metadata table 710 can also include additional datacorresponding to each block. For example, each entry of block metadatatable 710 can include an indication of a time (e.g., in hours), when theblock was written to the memory device. Additionally or alternatively,each entry can include an indication of a temperature (e.g., in Celsius)when the block was written to the memory device.

Block family table 720 is indexed by the block family number, such thateach record of the block family table 720 specifies, for the blockfamily referenced by the index of record, a set of voltage binsassociated with respective dice of the block family. In other words,each record of the block family table 720 includes a vector, eachelement of which specifies the voltage bin associated with the diereferenced by the index of the vector element (referred to as a binpointer). Although individual dice of a block family can be associatedwith different voltage bins, the block family itself is associated witha particular voltage bin. The block family manager component 113determines the voltage bin associated with a particular block familybased on the bin pointer having the lowest value included in the vectorfor the block family. In an illustrative example, the lowest bin pointervalue of the vector for block family 60 is associated with voltage bin 0(i.e., for die 1). Therefore, block family manager component 113associates block family 60 with voltage bin 0. Similarly, block families61-64 are associated with voltage bin 0 (because the lowest bin pointervalue in each of blocks 61-64 is 0), block family 59 is associated withbin 1, block family 5 is associated with bin 6, and block families 0-4are associated with bin 7. As an example, in response to receiving arequest to read data included in block family 60, block family managercomponent 113 uses the threshold voltage associated with voltage bin 0.

A “Bin 0” label 730 is shown next to block families 60-64 to illustratethat block families 60-64 are associated with (“in”) Bin 0, a “Bin 1”label 731 is shown next to block family 59 to illustrate that blockfamily 59 is in bin 1, a “Bin 6” label 736 is shown next to block family5 to illustrate that block family 5 is in bin 6, and a “Bin 7” label 737is shown next to block families 0-4 to illustrate that block families0-4 are in bin 7.

A bin boundary can represent a boundary between two adjacent blockfamilies that are each associated with a different voltage bin.Therefore, block family manager component 113 can identify a binboundary for a voltage bin based on the bin pointers of the vectorincluded in each record of the block family table 720. Block familymanager component 113 can identify a voltage bin boundary for aparticular voltage bin by identifying the oldest block family (i.e., theblock family least recently created) associated with a vector includingbin pointers for one or more die that correspond to the particularvoltage bin. As illustrated in FIG. 7 , the vectors for block families60-64 include a bin pointer associated with voltage bin 0. Block familymanager component 113 can associate block family 60 with a bin boundary722 for voltage bin 0, as block family 60 is the oldest block family ofblock family table 720 where the bin pointer for die 1 corresponds tovoltage bin 0. Block family manager component 113 can associate blockfamily 5 with bin boundary 724 for voltage bin 6, in accordance withpreviously described embodiments.

As time passes and time after program increases for a particular die,the threshold voltage levels associated with the die can change, and thedie can be re-assigned to an “older” bins that that has a thresholdvoltage level suitable for the time after program associated with thatdie. Block family manager component 113 can perform a calibration scanto update the bin pointers of each block families so that the binpointer of each die points to the bin that correspond to the time afterprogram of that die. Thus, the bin numbers can increase from the newestbin (e.g., 0) to the oldest bin (e.g., bin 7) over time as a result ofthe calibration scan. The calibration scan can be performed for aparticular voltage bin by identifying, using block family table 720, anoldest block family associated with the voltage bin (i.e., the blockfamily associated with the bin boundary). In some embodiments, blockfamily manager component 113 can perform the calibration scan for apredefined number of block families associated with the voltage bin(e.g., 2 block families for each calibration scan). In such embodiments,block family manager component 113 can select, using block family table720, the two oldest block families (e.g., block family 60 and blockfamily 61) associated with the voltage bin. In response to selecting thepredefined number of block families, block family manager component 113can perform the calibration scan, in accordance with embodimentsdescribed herein.

In operation, upon receiving a read command, the memory sub-systemcontroller determines the physical address corresponding to the logicalblock address (LBA) specified by the read command. Components of thephysical address, such as the physical block number and the dieidentifier, are utilized for performing the metadata table walk: first,the block table 710 is used to identify the block family identifiercorresponding to the physical block number; then, the block familyidentifier is used as the index to the family table 720 in order todetermine the voltage bin associated with the block family and the die;finally, the identified voltage bin is used as the index to an offsettable (not illustrated) in order to determine the threshold voltageoffset corresponding to the bin. The memory sub-system controller canthen additively apply the identified threshold voltage offset to thebase voltage read level in order to perform the requested readoperation.

FIG. 8A schematically illustrates example scan metadata generated by amedia-management scan and maintained by the memory sub-systemcontroller, in accordance with aspects of the present disclosure. A scanmetadata table 810 can be generated by the media-management scan duringor subsequent to each scan. The scan metadata table 810 includesmeasurement data, which can be values of data state metrics measured(e.g., read) by the media-management scan. The measurement data can be,for example, voltage offsets for each die of each block family. Atimestamp 840 indicates a time at which the scan metadata table 810 isgenerated or provided (e.g., made available) to the calibration scan.The calibration scan can use the timestamp to determine whether the scanmetadata table 810 is to be used (e.g., if the timestamp 840 is lessthan a threshold timestamp) or not to be used (e.g., if the timestamp840 is greater than a threshold timestamp). The measurement data shownin the scan metadata table 810 include entries for block families 0, 1,2, 5, and 11. Each entry includes voltage offsets for N dice for aparticular block family. For example, the entries for families 0, 1, and2 each contain the offset value −22 for each die. The entry for family 5contains the offset values −22, −19, −22, and −19 for dice 0, 1, N-1,and N, respectively, and the entry for family 11 contains the offsetvalues −17, −16, −17, and −16 for dice 0, 1, N-1, and N, respectively.

The calibration scan can use the offset values in the scan metadatatable 810 to calibrate bin pointers of the block families listed in thescan metadata table 810. For example, the calibration scan can identifythe lowest voltage offset for each block family listed in the scanmetadata table 810, and use the lowest voltage offset to determine thebin to be associated with the block family. The bin may be determinedusing a bin lookup table, such as the voltage boundary table 310, thatmaps ranges of voltages to bin numbers. For example, the voltageboundary table 310 may include an entry indicating that voltages in therange −27 to −21 volts correspond to bin 7. Since families 0, 1, and 2are associated with the voltage −22, families 0, 1, and 2 are in bin 7in this example. The voltage boundary table 310 may also include anentry indicating that voltages between −21 and −18 correspond to bin 6,and voltages between −18 and −15 correspond to bin 5. In this example,family 5 is in bin 6 because the lowest value, −19 volts, is between −21and −18, and family 11 is in bin 5 because the lowest value, −16 volts,is between −18 and −15. The scan metadata table 810 includes entries forblock families in older bins (e.g., bins 5, 6, and 7) because themedia-management scan generates data measurements for older bins.

FIG. 8B schematically illustrates example old blocks metadata 850generated by a calibration scan and maintained by the memory sub-systemcontroller, in accordance with aspects of the present disclosure. Thecalibration scan can identify one or more of the oldest blocks of one ormore bins and store the block identifiers of the identified oldestblocks in the old blocks metadata table 850. The media-management scancan prioritize the old blocks listed in the metadata table 850, e.g., byscanning the old blocks prior to scanning other blocks not listed in theold blocks metadata table 850. Prioritizing the old blocks can reducemiscalibration and bit read errors, because the old blocks are likelyclose to transitioning to another (older) bin, and scanning the oldblocks sooner can reduce the amount of time the old blocks remain in abin for which they have become too old (because the scan has notprocessed them yet).

The old blocks metadata table 850 includes an entry for each of thevoltage bins 0-7. Each entry contains the block numbers of the oldestblock, second oldest block, and third oldest block in the correspondingbin. The oldest block is older than the second oldest block, and thesecond oldest block is older than the third oldest block. For example,for bin 5, the oldest block is block 281, the second oldest block isblock 302, and the third oldest block is block 329. The media managementscan can prioritize these old blocks by scanning block 281 (e.g., assoon as possible) then scanning block 302, and then scanning block 329.To implement the prioritization, the media management scan can add theoldest block (281) to the head of a queue of blocks to be scanned, andadd the second and third oldest blocks (302 and 329) to the second andthird positions in the queue behind the head. The media management scancan scan the oldest blocks of bins 6 and 7 similarly. Alternatively, theold blocks metadata table 850 can include just the oldest block for eachbin, or the oldest N blocks for each bin, for any suitable number N.

The bins in the old blocks metadata table 850 that are older than a binage threshold 852 are labeled “older bins.” The bin age threshold isbetween bins 4 and 5 in the table 850, so bins 5, 6, and 7 are the olderbins. The media-management scan scans the older bins. Thus, the youngerbins (bins 0-4) can be omitted from the old blocks metadata table 850.However, the calibration scan can use the younger bins, so the youngerbins are also shown in the old blocks metadata table 850. The bin agethreshold 852 can be determined, for example, empirically, by performingexperiments on sample workloads of read and write commands, anddetermining which value of the bin age threshold 852 provides the bestperformance for the workloads.

FIG. 9 is a flow diagram of an example method to perform a calibrationscan, in accordance with aspects of the present disclosure. The method900 can be performed by processing logic that can include hardware(e.g., processing device, circuitry, dedicated logic, programmablelogic, microcode, hardware of a device, integrated circuit, etc.),software (e.g., instructions run or executed on a processing device), ora combination thereof. In some embodiments, the method 900 is performedby the block family manager component 113 of FIG. 1 . Although shown ina particular sequence or order, unless otherwise specified, the order ofthe processes can be modified. Thus, the illustrated embodiments shouldbe understood only as examples, and the illustrated processes can beperformed in a different order, and some processes can be performed inparallel. Additionally, one or more processes can be omitted in variousembodiments. Thus, not all processes are required in every embodiment.Other process flows are possible.

FIG. 9 illustrates an example calibration scan that can use data statemetric measurements (“measurement data”) provided to the calibrationscan by, for example, a media-management scan or other process. Themeasurement data provided to the calibration scan can be for older bins,and the calibration scan can perform calibration of block family-to-binassociations for the older bins based on the measurement data withoutperforming the time-consuming measurement operations that gather themeasurement data. For younger bins, the calibration scan can performmeasurement operations and use the resulting measurement data to performcalibration of block family-to-bin associations for the younger bins.

The voltage bins can be categorized as younger bins or older binsaccording to an age threshold criterion. For example, the age thresholdcriterion can specify that bins having numbers less than 5 are youngerbins, and bins having numbers greater than or equal to 5 are older bins.In other words, in an 8-bin memory sub-system, bins 0-4 can becategorized as younger bins, and bins 5-7 can be categorized as olderbins. The bin age threshold 852 that distinguishes younger bins fromolder bins (e.g., bin 4 from bin 5 in this example) can be determinedempirically, as described above, or by other suitable techniques.

The media-management scan can perform measurement operations for eachblock (at a different rate than the calibration scan) as part of itsoperation. The media-management scan can be, for example, a reliabilityscan that performs data integrity checking by measuring data statemetrics for each block to assess the integrity of the block. Themedia-management scan can perform measurement operations for all blocksin the memory sub-system, for example. The measurements of data statemetrics are referred to herein as “data measurements.” By providing thedata measurements for older bins to the calibration scan, embodimentsdisclosed herein can reduce or eliminate redundant data statemeasurements that would previously have been performed by thecalibration scan for the older bins. The calibration scan can then focuson younger bins by performing data state metric measurements for youngerbins more frequently, and using those data state metrics to performcalibration operations for block families associated with the youngerbins. Older bins are suitable for scanning by the media-management scanbecause their associations with block families change less frequentlythan the associations of younger bins, and the media-management scan islikely to scan blocks at a frequency lower than the frequencies used bythe calibration scan for younger blocks, but similar to or greater thanthe frequencies used by the calibration scan for older blocks.

At operation 902, the processing device identifies one or more firstvoltage offset bins of the memory device, wherein each of the firstvoltage offset bins satisfies a first age threshold criterion. Atoperation 904, the processing device identifies one or more secondvoltage offset bins of the memory device, wherein each of the secondvoltage offset bins satisfies a second age threshold criterion. Atoperation 906, the processing device identifies a first block familyassociated with one of the first voltage offset bins. The first andsecond age threshold criteria, like other thresholds described herein,can be determined empirically, e.g., by performing experiments on sampleworkloads of read and write commands, and determining which value ofeach threshold provides the best performance for the workloads.

At operation 908, the processing device begins to perform a first scanof the first block family by performing the following operations. Thefirst scan can be, for example, a block family calibration scan. Atoperation 910, the processing device determines one or more values ofthe first data state metric based on the first block. As an example, thevalues of the first data state metric can characterize a voltage shiftassociated with the first block. The values of the first data statemetric can be determined based on measurement of one or more voltages ofthe first block. The first block can be the oldest block in the firstblock family, and the measurement can be performed at a specifiedvoltage distribution, such as the 7th distribution (e.g., in the valleyof the specified voltage distribution).

At operation 912, the processing device identifies, based on one or morevalues of the first data state metric, a first identified voltage offsetbin. The memory sub-system 110 can store and access a voltage boundarytable 310 that comprises a set of voltage boundaries that correspond tovoltage ranges and associates each of the voltage boundaries or rangeswith a corresponding voltage offset bin. The first voltage offset bincan then be identified by determining that the voltage shift associatedwith the first block is within a particular one of the plurality ofvoltage ranges. The voltage boundary table 310 can map the particularone of the plurality of voltage ranges to the identified first voltageoffset bin, for example.

As described above, the calibration scan can use the measurement dataobtained by the media-management scan to perform calibration operations,such as updating bin pointers, for block families associated with theolder bins. The calibration scan can access and use the measurement datafrom the media-management scan as the measurement data becomes availableto the calibration scan (e.g., in response to the media-management scansending, or otherwise making available, the measurement data to thecalibration scan). The calibration scan can use the measurement data asit becomes available if, for example, the calibration scan is otherwiseidle (e.g., not performing calibration operations) at the time themeasurement data is provided to the calibration scan. Alternatively oradditionally, the media-management scan can store the measurement datain a scan metadata table 810, e.g., in association with the blockfamilies and dice to which the measurement data corresponds, and thecalibration scan can retrieve the measurement data from the scanmetadata table 810 as needed (e.g., when the calibration scan processesthe block families to which the measurement data corresponds).

The measurement data can include data state metrics for blocks in blockfamilies that are associated with older voltage bins. The calibrationscan can use the data state metrics to update the assignments of bins toblock families. For example, for each block family that is categorizedas being in an older bin, the media-management scan can measure andstore a value of a data state metric, such as a voltage, for each die ofa memory device. The media-management scan can store the measured valuesin a block family metadata table 720 or other suitable data structure.The calibration scan can access the stored values of the data statemetric when it scans each block family, and use the stored values toupdate the assignments of bins to the block family. For example, whenscanning a block family, the calibration scan can, for each die,determine which bin the block family should be assigned to for that diebased on the stored value of the data state metric for that block familyand die, and update the assignments accordingly. Updating theassignments can result in one or more of the bin pointers associatedwith the block family being incremented by one (or more), therebyassigning the block family to an older bin (for the one or more dice).As an example, the calibration scan can increment the bin pointer ofblock family 5 for die 1 from 6 to 7 if the measurement data for blockfamily 5, die 1 is within a range that corresponds to bin 7 according toa voltage boundary table 310, as described above with reference to FIG.8A.

At operation 914, the processing device identifies one or more values ofa second data state metric in scan metadata generated by a second scan,wherein the one or more values of the second data state metric areassociated with the second block family. The second scan can be, forexample, and media management scan. The scan metadata can be, forexample, the scan metadata table 810. In circumstances where thecalibration scan selects a block family associated with an older bin,but the second scan has not yet provided measurement data for that blockfamily (e.g., values of the second data state metric in the scanmetadata), then the calibration scan can perform the measurementoperations to obtain the measurement data for that block family. Suchcircumstances are expected to be rare, because older bins are scanned ata substantially slower rate than younger bins, as described above.

At operation 916, the processing device identifies, based on the one ormore values of the second data state metric, a second identified voltageoffset bin. The second identified voltage offset bin can be identifiedusing the voltage boundary table 310, similarly to how the firstidentified voltage offset bin is identified (as described above withrespect to operation 912), for example. At operation 918, the processingdevice associates the first block family with the first identifiedvoltage offset bin and the second block family with the secondidentified voltage offset bin. The processing device can associate thefirst block family with the identified voltage offset bin byidentifying, at a block family metadata table 720, a particular entrycorresponding to the first block family, and updating the particularentry to include a reference to the identified voltage offset bin.

FIG. 10 is a flow diagram of an example method to perform amedia-management scan, in accordance with aspects of the presentdisclosure. The method 1000 can be performed by processing logic thatcan include hardware (e.g., processing device, circuitry, dedicatedlogic, programmable logic, microcode, hardware of a device, integratedcircuit, etc.), software (e.g., instructions run or executed on aprocessing device), or a combination thereof. In some embodiments, themethod 1000 is performed by the block family manager component 113 ofFIG. 1 . Although shown in a particular sequence or order, unlessotherwise specified, the order of the processes can be modified. Thus,the illustrated embodiments should be understood only as examples, andthe illustrated processes can be performed in a different order, andsome processes can be performed in parallel. Additionally, one or moreprocesses can be omitted in various embodiments. Thus, not all processesare required in every embodiment. Other process flows are possible.

At operation 1002, the processing device performs a second scan of aplurality of second blocks of a memory device by, for each of the secondblocks, performing the following operations. The first and second scanscan performed concurrently, e.g., as processes that are executedconcurrently by the memory system controller 115. The first scan canprocess the first voltage offset bins more frequently than the secondvoltage offset bins, and the second scan can process the first voltageoffset bins at the same frequency as the second voltage offset bins. Thefirst scan can be, for example, a calibration scan. The second scan canbe a media management scan, and the media management scan can, for eachof the second blocks, perform a media management operation based on theone or more values of the second data state metric associated with thesecond block. The media management operation can include data integritychecking, garbage collection, folding, or wear leveling, for example.For example, a media-management scan that performs data integritychecking is referred to herein as a “reliability scan.”

The media-management scan can prioritize blocks from block families thatare near a transition to a different bin, so that measurement data forthe blocks can be obtained and made available to the calibration scansooner than measurement data for other blocks. Priority is given toblock families near a bin transition by updating the bin pointersassociated with these block families (which can be performed by thecalibration scan) as soon as possible after the temporal voltage shiftof memory cells moves from a range associated with thecurrently-assigned bin to a range associated with an older bin. Updatingthe bin pointer to point to the older bin as soon as possible can avoidmiscalibration of the read levels of the current bin with the temporalvoltage cell of the cells in which the block is stored. Miscalibrationcan lead to higher bit error counts, for example. The media-managementscan can prioritize blocks by, for example, scanning them prior toscanning other blocks. The calibration scan can identify the blocks tobe prioritized and include them in a list or table of prioritizedblocks. The list or table of prioritized blocks can be provided to themedia-management scan, which can scan the blocks in the list ofprioritized blocks prior to scanning blocks not in the list. Theprioritized blocks can be blocks that are associated with block familiesthat are in older bins, since the media-management scan need not providedata measurements for younger bins to the calibration scan. Theprioritized blocks can be the oldest blocks in each of the older bins(e.g., up to a threshold number of the oldest blocks in each of theolder bins). The calibration scan can obtain a list of the oldest blocksfrom the block family manager component 113, which can keep track of theblock families sequentially by age within each bin (e.g., as shown inFIG. 7 ). The media scan can also obtain information about the oldestblocks per bin from the block family manager component 113 to identifythe oldest blocks per bin. The memory sub-system can keep track of thepower-on minute or hour at which the block was created or written to(and sorted into an ordering by age). The list or table of prioritizedblocks can be an old blocks metadata table 850 as shown in FIG. 8B, forexample.

To identify prioritized blocks that are near bin transitions, theprocessing device can access the old blocks metadata table 850. The oldblocks metadata table 850 includes one or more entries. Each entrycorresponds to a bin and specifies an oldest block of the bin. For eacholdest block specified by each entry in the old blocks metadata table850, the processing device can perform the second scan for the oldestblock prior to performing the second scan for other (second) blocks thatare not specified as an oldest block in the old blocks metadata table850.

The media-management scan can use different bin selection criteria thanthe calibration scan. Since the media-management scan can process blocksat a slower rate than the calibration scan, the media-management scancan obtain more measurement data for each block than the calibrationscan. For example, the calibration scan ordinarily obtains measurementdata by measuring a data state metric (e.g., a read level voltage) thatis a position metric of a single specified voltage distribution (e.g., alevel 7 distribution), since the additional time involved in measuringmore than one of the voltage distributions would reduce the scanfrequency. However, the media-management scan has time available toperform additional measurements. Thus the media-management scan canmeasure two or more of the voltage distributions, such as a level 1distribution and the level 7 distribution, to obtain two read levelvoltages. Measuring more voltage distributions ordinarily increases theaccuracy of the measurement. For example, reading a level 7 distributionprovides a single read level voltage, which can be used to determine achange in read level voltage at a particular time since program.However, reading the level 1 and 7 distributions provides two values,which can be used to determine changes in read level voltages of levels1 and 7, respectively.

The value of the change in read level voltage can be used, e.g., with atemporal voltage shift (TVS) function that describes how the data statemetric varies as a function of time after program, to determine a timeafter program of the block being scanned. The time after program cansubsequently be used to determine which bin to associate with the block.Interpolation of levels 1 through 6 based on a value of the level 7distribution can be imprecise, however. Further, a single measured valueis susceptible to noise that can distort the measured value. Themedia-management scan can measure multiple values because it operates ata slower rate than the calibration scan. Further, the media-managementscan can use multiple measured values (e.g., of all 8 distributions) todetermine the reliability of the block, and can provide these values foruse by the calibration scan. Thus, for example, the media-managementscan can provide measured values of the level 1 and level 7distributions to the calibration scan. The calibration scan can use thelevel 1 value to interpolate levels 1-3, and can use the level 7 valueto interpolate levels 4-6. Thus, the availability of the additionalmeasured value can increase the accuracy of the measurements of thelevels 1-3 distributions provided to the calibration scan.

At operation 1004, the processing device identifies a second blockfamily associated with the second block, e.g., using the block metadatatable 710. At operation 1006, the processing device determines whetherthe second block family is associated with one of the second voltagebins, e.g., using the block family metadata table 720.

At operation 1008, if the second block family is associated with one ofthe second voltage bins, the processing device performs operation 1010.If the second block family is not associated with one of the secondvoltage bins, the method 1000 ends. At operation 1010, the processingdevice determines one or more values of a second data state metric basedon the second block. The values of the second data state metric cancharacterize a voltage shift associated with the second block. Thesecond data state metric can include a first position metric of a firstspecified voltage distribution and a second position metric of a secondspecified voltage distribution. The values of the second data statemetric can be determined by measuring the first and second specifiedvoltage distributions within one or more memory cells of the secondblock for example. The values of the second data state metric can bedetermined based on the second block by determining that the blockfamily metadata table 720 does not include an indication that the secondblock has been processed. Further, the block family metadata table 720can be updated to include an indication that the second block has beenprocessed.

At operation 1012, the processing device updates the scan metadata table810 to include an entry associating the second block family with the oneor more values of the second data state metric. The processing devicecan provide the scan metadata table 810 to the first scan, and can alsoprovide a timestamp 840 to the first scan, the timestamp indicating atime at which the scan metadata table 810 is provided or generated.

FIG. 11A schematically illustrates an example calibration scan performedby the memory sub-system controller based on a particular scanfrequency, in accordance with aspects of the present disclosure. Thecalibration scan is illustrated as a sequence of operations that occurover time. Each operation is shown as vertical line, and time increasesas shown by the arrow pointing to the right. The illustrated operationsinclude read operations 1102 (each shown as one of the shortest verticallines), write operations 1104 (each shown as one of the medium-lengthvertical lines), and scan iterations 1106 (each shown as one of thetallest vertical lines). Each read operation 1102 or each writeoperation 1104 can be performed by the memory sub-system in response toa respective read or write request from the host. A calibration scaniteration 1106 can include execution of a calibration process thatcalibrates at least a portion of the block families in the memorysub-system. The calibration scan process can be executed repeatedly atdifferent times (e.g., periodically, or in response to particularcriteria being satisfied). For example, in each calibration scaniteration, one or more bins can be calibrated. The lines in FIGS.11A-11D are not necessarily shown to scale. Each line can be understoodas representing the execution of the represented operation, or at leastan initiation of execution of the represented operation. The representedoperation can finish during the time between the left and right sides ofthe line, but does not necessarily finish within that time, for example.

Each of the scan iterations 1106 a-i can represent an invocation of acalibration scan process that calibrates block families withcorresponding bins by updating bin pointers of the block families. Thescan iterations 1106 a-i are initiated at a particular frequency, e.g.,one scan iteration per minute or other suitable frequency. Thus, forexample, there can be a time delay between the initiation of iteration1106 a and the initiation of iteration 1106 b, and so on. The time delaycan correspond to the scan frequency, and can be one millisecond, 10milliseconds, 1 second, 1 minute, or other suitable amount of time. Asshown by the read operations 1102 and write operations 1104, one or moreread and/or write operations can be performed by the memory sub-systembetween consecutive scan iterations 1106.

Each calibration scan iteration 1106 can scan one or more selected bins,e.g., by performing each calibration scan iteration 1106 for blockfamilies associated with a selected set of bins. The selected set ofbins can be different for different scan iterations 1105. For example,younger bins, such as Bin1, can be scanned more frequently than olderbins. Thus, as shown, Bin1 is scanned in each scan iteration 1106 a-i.Bin2, which is older than Bin1, is scanned in every other scan iteration1106 b, 1106 d, 1106 f, 1106 h. Bin3, which is older than Bin1, isscanned in every eighth scan iteration 1106 i. Older bins, such as Bin4through Bin8, are scanned less frequently, in accordance with theirassociated age rankings (e.g., Bin1 can be ranked first in terms of age,and Bin8 is can be ranked 8th in terms of age). As can be seen, scaniterations of Bin1 are initiated at a first frequency, while scaniterations of Bin2 are initiated at a second frequency, which is lowerthan the first frequency. Thus, a calibration scan process can beinvoked at the first frequency, but perform a scan of one bin (e.g.,Bin1) at the first frequency, and perform a scan of another bin (e.g.,Bin2) at a lower second frequency by scanning Bin2 in a subset of theinvocations (e.g., every other invocation, which corresponds to a secondfrequency that is half the first frequency). Although the frequency ofscan iterations 1106 is different for each bin, the frequency of eachindividual bin does not vary in the example of FIG. 11A. For example, inFIG. 11A, Bin1 is scanned at a first constant frequency, and Bin2 isscanned at a second constant frequency that is half the first constantfrequency.

In particular embodiments, a block family manager component 113 canadjust a calibration scan to adapt to changing characteristics of thememory sub-system. These characteristics of the memory sub-system arereferred to herein as calibration demand characteristics, and caninclude system state characteristics, such as the power state of thememory sub-system or host, as well as characteristics of data beingstored, such as workload characteristics. The calibration demandcharacteristics can include block family age, power state of the memorysub-system or host (e.g., active, idle, hibernating, or sleeping),workload size, program/erase cycle counts of the memory device, or readerror rates. The calibration scan can be adjusted to adapt to changingcalibration demand characteristics by, for example, changing thefrequency of scan iterations 1106 in accordance with the calibrationdemand characteristics. These calibration scan adjustments can beperformed in response to detecting that threshold calibration demandcriteria are satisfied by the calibration demand characteristics. Forexample, a set of calibration demand ranges, such as ranges of readerror rates, can be associated with a set of corresponding calibrationscan adjustment parameters that specify how to adjust the calibrationscan for each of the ranges. Alternatively, other suitable functionsthat map calibration demand to calibration scan adjustment parameterscan be used, such as linear or quadratic functions. As such, thecalibration scan can be adjusted based on the characteristics associatedwith the memory sub-system by modifying scan parameters such as the scanfrequency or other criteria that control the initiation of calibrationscans.

As an example, the calibration scan can be adjusted based on blockfamily age by selecting a subset of the block families and scanning thesubset of block families, but not other block families not in thesubset. The subset of block families to be scanned can be, for example,the oldest block family in each voltage bin, as described with referenceto FIGS. 12A and 12B below. The frequency of calibration scan iterations1106 can be adjusted based on memory sub-system characteristics, asdescribed below with reference to FIG. 12B. The block families to beprioritized in calibration scans can be determined based on read errorrates associated with the block families, as described with reference toFIG. 12C below. Prioritizing block families based on read error ratescan involve scanning block families associated with read error ratesgreater than a threshold error rate prior to scanning other blockfamilies having error rates below the threshold.

The frequency of calibration scan iterations 1106 can be adjusted basedon memory sub-system characteristics such as a power state, as describedbelow with reference to FIG. 11B. The frequency of calibration scaniterations 1106 can also be adjusted in proportion to a workloadintensity, such as the number of write operations performed by thememory sub-system over a period of time, as described below withreference to FIGS. 11C and 12D, or in proportion to a number ofprogram/erase cycles associated with a memory device, as described belowwith reference to FIGS. 11D and 12D.

FIG. 11B schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency varies based on power state, in accordance with aspects of thepresent disclosure. In an idle state, the memory sub-system can devote asubstantial amount of processing time to performing scan operationsbecause few if any host reads or writes are expected to occur. Thus, inresponse to a transition from an active state 1114 to an idle state1116, the block family manager component 113 can increase the scanfrequency, as shown by scan iterations 1108. For example, in the idlestate 1116, the scan frequency can be increased to a higher frequencythat corresponds minimal delay (e.g., no delay, or less than a thresholdamount of delay) between scan iterations 1108. As another example, inthe idle state 1116, the scan frequency can be increased to a higherfrequency in response to the memory sub-system or host system being inan idle power state for at least a threshold amount of time. Further, inan idle power state, the block family manager component 113 can invokethe scan process repeatedly until all bins have been calibrated, or thepower state changes. As another example, the calibration scan, or aniteration of the calibration scan, can be performed in response todetermining that at least one threshold criterion is satisfied. Thethreshold criterion can be based on a power state of the memorysub-system or host system. For example, the threshold criterion can besatisfied when the memory sub-system is in an idle power state, or whenthe memory sub-system transitions to an active power state from a sleepstate or low power state.

In a low power state 1118 (e.g., hibernation), the block family managercomponent 113 can cause the memory sub-system to perform scan iterations1110 at a lower scan frequency. For example, the memory sub-systemcontroller can wake up (e.g., transition from low power to idletemporarily) at the lower frequency and scan a number of blocks or pagesspecified by a parameter. The reduced scan frequency can be based on ascan period parameter, e.g., a number of milliseconds between wake ups.The periodic wake up and scan can continue until a scan instance iscomplete, e.g., until all block family bin pointers in one or more blockfamilies have been calibrated. Subsequent to the low power state 1118,the memory sub-system and/or host can transition to an active state 1120in which scan iterations can continue at the frequency used in theprevious active state 1114.

As another example, a calibration scan can be performed in response to atransition of the host's power state from a sleep state to anotherstate, such as a low power (e.g., hibernation) state, or from a sleepstate or low power state to an active power state. The calibration scancan be performed on the oldest block bins to determine whether theassignment of block families to the oldest block bins are to be changed(e.g., based on data state metrics). If the assignment to the oldestblock families is not to be changed, neither have the assignments ofother bins to other (younger) block families. Thus, the calibration scancan stop after scanning the oldest block bins.

FIG. 11C schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency varies based on workload, in accordance with aspects of thepresent disclosure. When writes are occurring frequently, a relativelylarge amount of new data is likely to be written, which means that newdata is entering bin 1. Thus, the calibration scan should be performedat a relatively high frequency to calibrate the new data. When writesare not occurring, e.g., because the host has stopped writing data for aperiod of time, the existing data continues aging. As the time after thelast write (e.g., program) of data increases, the data moves to olderbins, and, as the time continues to increase, does not move between binsas quickly. Thus, the scan frequency can be reduced after a thresholdamount of time has passed since the most recent write.

As shown in FIG. 11C, prior to a time T1, the block family managercomponent 113 initiates scan iterations 1126 a, 1126 b, and 1126 c at abaseline scan frequency (e.g., one iteration per minute). At time T1, amost recent write 1124 has occurred. No subsequent writes occur untiltime T3, at which time a subsequent write 1128 occurs. The block familymanager component 113 can decrease the scan frequency in response todetecting that no writes have been performed for at least a thresholdperiod of time, which is shown as a threshold period without writesbetween times T1 and T2. At time T2, the block family manager component113 detects that the threshold period without writes has elapsed, andchanges the scan frequency to a reduced scan frequency, such as apredetermined reduced value (e.g., one iteration every three minutes),or a predetermined fraction of the baseline scan frequency (e.g., ⅓).The scan frequency can remain at the reduced value, as shown by scaniterations 1126 d and 1126 e, until a subsequent write 1128 is detectedat time T3. In response to detecting the subsequent write 1128, theblock family manager component 113 changes the scan frequency back tothe baseline scan frequency, since writes are occurring more frequentlyand bin assignments are more likely to change.

FIG. 11D schematically illustrates an example calibration scan,performed by the memory sub-system controller, in which the scanfrequency is increased based on program/erase count, in accordance withaspects of the present disclosure. The age of a memory device can beapproximated by a number of program/erase cycles (PEC) that the devicehas performed. When the memory device is young, e.g., less than 100 PEC,the temporal voltage shift is relatively small. As the device ages,e.g., after 100 PEC, the temporal voltage shift increases. As the devicecontinues to age, e.g., after 1000 PEC, the temporal voltage increasesstill further. Thus, the scan frequency can be set to a relatively lowbaseline value, e.g., 1 scan iteration every 10 seconds, when the devicehas less than a first threshold number of PEC, such as 100 PEC. The scanfrequency can be set to a first increased value, e.g., 1 scan iterationevery 5 seconds, when the device has between a first threshold numberTh1 of PEC (e.g., 100) and a second threshold number Th2 of PEC (e.g.,1000). The scan frequency can be set to a second increased value, e.g.,1 scan iteration every second, when the device has between the secondthreshold number Th2 of PEC (e.g., 1000) and a third threshold numberTh3 of PEC (e.g., 10,000), and so on.

As shown in FIG. 11D, scan iterations 1136 a, 1136 b, and 1136 c areperformed at a baseline scan frequency when the PEC is less than thefirst threshold. At time T4, the PEC becomes greater than the firstthreshold. Subsequent to time T4, the block family manager component 113detects that the PEC is between the first and second thresholds, andsets the scan frequency to the first increased value. The scaniterations between iteration 1136 c and iteration 1136 d occur at thefirst increased scan frequency. At time T5, the PEC becomes greater thanthe second threshold. Subsequent to time T5, the block family managercomponent 113 detects that the PEC is between the second and thirdthresholds, and sets the scan frequency to the second increased value.Scan iterations subsequent to iteration 1136 d occur at the secondincreased scan frequency.

FIG. 12A is a flow diagram of an example method 1200 to perform acalibration scan in which the oldest block family within each bin isscanned, in accordance with aspects of the present disclosure. Thecalibration scan can be adjusted to prioritize blocks that are likely tobecome mis-calibrated in the near future. Block families can beprioritized based on their age by selecting a subset of the blockfamilies that are older than other block families, and scanning thesubset of block families, but not other block families. The subset ofblock families to be scanned can be, for example, the oldest blockfamily in each voltage bin, or the oldest N block families in eachvoltage bin, where N is a threshold number of oldest block families.

The method 1200 can be performed by processing logic that can includehardware (e.g., processing device, circuitry, dedicated logic,programmable logic, microcode, hardware of a device, integrated circuit,etc.), software (e.g., instructions run or executed on a processingdevice), or a combination thereof. In some embodiments, the method 1200is performed by the block family manager component 113 of FIG. 1 .Although shown in a particular sequence or order, unless otherwisespecified, the order of the processes can be modified. Thus, theillustrated embodiments should be understood only as examples, and theillustrated processes can be performed in a different order, and someprocesses can be performed in parallel. Additionally, one or moreprocesses can be omitted in various embodiments. Thus, not all processesare required in every embodiment. Other process flows are possible.

At operation 1202, the processing device performs a block familycalibration scan of the memory device, wherein the calibration scancomprises a plurality of scan iterations. Each scan iteration isinitiated in accordance with at least one threshold scan criterion, andwherein each scan iteration comprises the following operations. Atoperation 1204, the processing device identifies at least one firstvoltage bin, wherein each first voltage bin is associated with aplurality of read level offsets. At operation 1206, the processingdevice identifies, according to a block family creation order, an oldestblock family from a plurality of block families associated with thefirst voltage bin. As shown in FIG. 7 , an oldest block familyassociated with a bin can be identified by searching the block familymetadata table 720 for the lowest block family number associated withthe bin. As described above with respect to FIG. 7 , in the exampleblock family metadata table 720, bin 0 is associated with block families60-64, bin 1 is associated with block family 59, bin 6 is associatedwith block family 5, and bin 7 is associated with block families 0-4. Inthe block family metadata table 720, the oldest block family associatedwith bin 0 is block family 60, since the block family number (60) is thelowest of the block family numbers associated with bin 0. Similarly, theoldest block family associated with bin 1 is block family 59, the oldestblock family associated with bin 6 is block family 5, and the oldestblock family associated with bin 7 is block family 0.

At operation 1208, the processing device performs a block familycalibration by updating at least one bin pointer of the oldest blockfamily in view of a data state metric of at least one block of theoldest block family.

FIG. 12B is a flow diagram of an example method 1210 to perform acalibration scan in which the block family prioritization andcalibration scan frequency can vary based on characteristics associatedwith the memory sub-system, in accordance with aspects of the presentdisclosure. The method 1210 can prioritize selected block families andadjust the calibration scan frequency.

The method 1210 can be performed by processing logic that can includehardware (e.g., processing device, circuitry, dedicated logic,programmable logic, microcode, hardware of a device, integrated circuit,etc.), software (e.g., instructions run or executed on a processingdevice), or a combination thereof. In some embodiments, the method 1210is performed by the block family manager component 113 of FIG. 1 .Although shown in a particular sequence or order, unless otherwisespecified, the order of the processes can be modified. Thus, theillustrated embodiments should be understood only as examples, and theillustrated processes can be performed in a different order, and someprocesses can be performed in parallel. Additionally, one or moreprocesses can be omitted in various embodiments. Thus, not all processesare required in every embodiment. Other process flows are possible.

At operation 1212, the processing device determines a calibration scanfrequency. At operation 1214, the processing device evaluates one ormore scan criteria, and determines whether the scan criteria aresatisfied. The scan criteria can be, for example, that the currentsystem time has exceeded the time at which the next scan iteration is tobe performed according to the current scan frequency. Other scancriteria are possible. For example, the scan criteria can be that thepower state of the memory sub-system and/or the host has transitioned.The transition can be to an idle state, for example, in which case thescan frequency can be increased, as described above. The transition canbe to a low power (e.g., hibernate) state as another example, in whichcase the scan frequency can be decreased and a wake up can be scheduledto occur to perform the next scan iteration at a time based on thedecreased scan frequency.

At operation 1216, if the scan criteria are not satisfied, theprocessing device waits, e.g., for a period of time, or until acondition is detected, such as a change in a condition that affects thescan criteria (e.g., a host read or write operation). After waiting, theprocessing device performs operation 1214 again to determine whether thescan criteria are satisfied.

If operation 1214 determines that the scan criteria are satisfied, thenat operation 1218, the processing device generates or updates a list ofblock families or blocks to scan. The block families to scan can be,e.g., the oldest block families in each bin. The blocks can beindividual blocks, in which case the blocks can be mapped to theirassociated block families. The list can be generated by the method ofFIG. 12C. At operation 1220, the processing device performs a scaniteration by calibrating one or more bin pointers of each block familyidentified by the list. If the list contains a block, the block familyassociated with the block can be calibrated. To calibrate a blockfamily, the processing device can update one or more bin pointers of theblock family in view of a data state metric determined by a measurementof at least one of the block family's blocks (e.g., measurement of amemory cell that stores a portion of the block). At operation 1222, theprocessing device determines an updated calibration scan frequency. Theupdated calibration scan frequency can be determined by the method ofFIG. 12D. Operation 1222 causes the processing device to invokeoperation 1214 to prepare to perform another scan iteration. The nextscan iteration can use the list of families or blocks generated atoperation 1218 and the updated calibration scan frequency determined atoperation 1222.

FIG. 12C is a flow diagram of an example method 1230 to determine blockfamily prioritization based on characteristics associated with thememory sub-system, in accordance with aspects of the present disclosure.The block families to be prioritized in calibration scans can bedetermined based on read error rates associated with the block families,for example. Prioritizing block families based on read error rates cancause the block family manager component 113 to scan block familiesassociated with read error rates greater than a threshold error rateprior to scanning other block families having error rates below thethreshold. Alternatively, a priority can be associated with each blockfamily based on a function of the block family's error rate, and theblock families can be scanned in order according to their priorities.

The method 1230 can be performed by processing logic that can includehardware (e.g., processing device, circuitry, dedicated logic,programmable logic, microcode, hardware of a device, integrated circuit,etc.), software (e.g., instructions run or executed on a processingdevice), or a combination thereof. In some embodiments, the method 1230is performed by the block family manager component 113 of FIG. 1 .Although shown in a particular sequence or order, unless otherwisespecified, the order of the processes can be modified. Thus, theillustrated embodiments should be understood only as examples, and theillustrated processes can be performed in a different order, and someprocesses can be performed in parallel. Additionally, one or moreprocesses can be omitted in various embodiments. Thus, not all processesare required in every embodiment. Other process flows are possible.

At operation 1232, the processing device, identifies the oldest blockfamily in the voltage bin, e.g., as described above with respect to FIG.12A. At operation 1234, the processing device identifies one or moreblock families or blocks having error rates greater than a thresholderror rate. The error rates can be read error rates, which theprocessing device can determine as a ratio of the number of reads thathave caused an error handler to be executed to a total number of readsperformed by the memory sub-system. Alternatively or additionally, theerror rates can be raw bit error rates (RBER), which can be determinedbased on information collected when data is read from a block family (orblock). The information can include mean, median, maximum, and minimumnumber of bits in error per block family (or block). If the read errorrate associate with a block family or block is greater than a read errorrate threshold, or the raw bit error rate is greater than a raw biterror rate threshold, then the associated block family (or block) can beprioritized for recalibration, e.g., by adding the associated blockfamily (or block) to the list. At operation 1236, the processing deviceadds each of the identified block families or blocks to the list ofblock families or blocks to scan.

FIG. 12D is a flow diagram of an example method 1240 to determinecalibration scan frequency based on characteristics associated with thememory sub-system, in accordance with aspects of the present disclosure.The frequency of calibration scan iterations 1106 can be adjusted inproportion to a workload intensity, such as the number of writeoperations performed by the memory sub-system over a period of time, orin proportion to a number of program/erase cycles associated with amemory device.

The method 1240 can be performed by processing logic that can includehardware (e.g., processing device, circuitry, dedicated logic,programmable logic, microcode, hardware of a device, integrated circuit,etc.), software (e.g., instructions run or executed on a processingdevice), or a combination thereof. In some embodiments, the method 1240is performed by the block family manager component 113 of FIG. 1 .Although shown in a particular sequence or order, unless otherwisespecified, the order of the processes can be modified. Thus, theillustrated embodiments should be understood only as examples, and theillustrated processes can be performed in a different order, and someprocesses can be performed in parallel. Additionally, one or moreprocesses can be omitted in various embodiments. Thus, not all processesare required in every embodiment. Other process flows are possible.

At operation 1242, the processing device determines whether a writeoperation has occurred between the current (system) time and a thresholdtime in the past. If so, then the processing device performs operation1246. If a write operation has not occurred between the current time andthe threshold time in the past, then at operation 1244, the processingdevice decreases the scan frequency by a determined amount, e.g., asdescribed above with respect to FIG. 11C. Operation 1244 then causes theprocessing device to perform operation 1250.

At operation 1246, the processing device determines whether the scanfrequency is at a decreased level as a result of a previous execution ofoperation 1244. If so, at operation 1248, the processing deviceincreases the scan frequency by the determined amount. Alternatively,the processing device can set the scan frequency to a baseline frequencyif operation 1244 reduced the scan frequency from the baselinefrequency. If the scan frequency is not at a decreased level as a resultof operation 1244, then the processing device performs operation 1250.At operation 1250, the processing device determines whether the numberof program/erase cycles (PEC) associated with the memory sub-system isgreater than a PEC threshold. If so, at operation 1252, the processingdevice increases the calibration scan frequency based on the number ofprogram/erase cycles. If the number of program/erase cycles (PEC)associated with the memory sub-system is not greater than a PECthreshold, then the calibration scan frequency is not changed.

FIG. 13 schematically illustrates calibration scans, performed by thememory sub-system controller, in which the scan frequency varies basedon power state, in accordance with aspects of the present disclosure.Each calibration scan is illustrated as a sequence of operations thatoccur over time. Each operation is shown as vertical line, and timeincreases as shown by the arrow pointing to the right. FIG. 13 showsoperations that occur during an active state 1310, an idle state 1314, alow power (hibernation) state 1318, a deferral state 1324 that occurs asa result of a sleep request 1320, and a sleep state 1326 that alsooccurs as a result of the sleep request 1320 and is subsequent to thedeferral state 1324. The active state 1310 is similar to the activestate described above with respect to FIG. 11A. The active state 1310corresponds to a time period in which a memory sub-system (or hostsystem) is in an active (e.g., awake) power state. The times at whichparticular example operations are initiated are represented by readoperations 1302 (each shown as one of the shorted vertical lines), writeoperations 1304 (each shown as one of the medium-length vertical lines),and scan iterations 1306 (each shown as one of the tallest verticallines). “Calibration scan iteration” herein can refer to an execution ofa calibration process that calibrates at least a portion of the blockfamilies in the memory sub-system. The calibration process can beexecuted repeatedly at different times (e.g., periodically, or inresponse to particular criteria being satisfied). Adjusting thecalibration scan frequency can include performing calibration scaniterations in response to certain criteria, such as a power statetransition to an idle state from a hibernating state. A calibration scaniteration can be initiated in response to the memory sub-systemtransitioning to a higher power state from a lower power state, such toan active state from a low-power state, such as a sleep or hibernatestate.

Each read operation 1302 or each write operation 1304 can be performedby the memory sub-system in response to a respective read or writerequest from the host. A calibration scan iteration can be execution ofa calibration process that can be executed repeatedly at different times(e.g., periodically, or in response to particular criteria beingsatisfied). Further details of the active state, the read operation1302, write operation 1304, and calibration scan iterations 1306 areprovided above with respect to FIG. 11A.

The power state of the system can correspond to a level of activityoccurring in the system, and to an amount of power that can be consumedby the system. The power state of the system at a particular time can beone of a set of supported states, such as active, idle, low power(hibernate), or deep sleep. The power state can transition between thesupported states in response to particular conditions being detected bythe system, such as passage of a threshold amount of time withoutreceiving input form a user, battery charge falling below a thresholdlevel, receiving user input requesting a power state change, and so on.In the active state, the system (e.g., the memory sub-system and/or thehost system) is awake and in a full running state in which the memorysub-system accepts host commands and performs read or write operationsrequested by the host. When the system is in the active power state,calibration scans can be performed at particular “baseline” frequency,which can change over time. Calibration scans can also be performed inthe active power state at other times, such as in response to detectinga high read error rate. For example, the baseline frequency can be usedto determine when to initiate calibration scan iterations for bin 1, asdescribed above. The frequencies of the other bins can be based on thefrequency of bin 1 (e.g., logarithmically decreasing with bin age), sothe frequency of each bin, though different from other bins, can remainconstant during times in which the system is in the active power state.The baseline frequency can also change over time, e.g., in response tochanges in workload intensity or the number of program/erase cycles. Thebaseline scan frequency can be, for example, the time between scans oftwo page lines, or other suitable duration of time.

In the example of FIG. 13 , the memory sub-system enters (e.g.,transitions to) an idle state 1314 subsequent to the active state 1310.The transition to the idle state 1314 can occur in response to criteriasuch as user input, absence of application activity or user input, andso on. The idle state 1314, for example, can be a state in which hostcommands are not being received, though the system is still awake (e.g.,devices are still powered on). The idle state 1314 can correspond to theidle sate of a mobile device, or the PS0 state of an NVMe/PCIe device.Since the host is not requesting operations in the idle state, thememory sub-system can perform idle scan operations 1312 at a highfrequency, e.g., as high of a frequency as possible, by using as much ofits processing resources as possible to perform and complete acalibration scan instance (e.g., a scan of all bins or of all blockfamilies). There is little if any delay between initiation ofconsecutive scan iterations 1312. For example, the memory sub-system canperform each subsequent idle scan iteration 1312 immediately aftercompletion of the prior scan iteration 1312, without initiating otheroperations between subsequent idle scan iterations 1312. A calibrationscan instance can be completed by, for example, performing the scan foreach block family of each voltage bin. For example, the memorysub-system can perform the calibration scan until the calibration iscomplete (e.g., one calibration instance is complete) or until the powerstate changes to a state other than idle. As can be seen by the absenceof operations near the end of the idle state 1314 in FIG. 13 , anexample calibration scan instance has been completed prior to the end ofthe idle state 1314. During the idle state 1314 or in a subsequent idlestate (not shown), the memory sub-system can begin another calibrationscan instance after completion of a calibration scan instance, or aftera period of time has elapsed since completion of the previouscalibration scan instance.

Subsequent to the idle state 1314, the memory sub-system enters alow-power (hibernation) state 1318. The low power (hibernate) state canbe a state in which power consumption is reduced to a low level andprocessing is stopped for periods of time. System clocks can be sloweddown to reduce power consumption by the memory sub-system. Certaincomponents, such as the memory cells, can remain powered on. The lowpower (hibernate) state 1318 can have a self-awake feature in which thememory sub-system controller can request that operations be performed,e.g., at a specified frequency. In the low power (hibernate) state 1318,the memory sub-system can cause iterations of the calibration scan to beperformed at the specified frequency, and each iteration can scan aspecified number of pages. In the low power state 1318, the memorysub-system performs low power scan iterations 1316 in bursts that areinitiated at the specified frequency. Each burst can include one or moreof the scan iterations 1316. In the illustrated example, threeiterations are shown in each burst. The scanning can continue at thespecified frequency in the low power (hibernate) state until, forexample, a scan instance (e.g., a scan of all bins or of all blockfamilies) has been completed, or the system power state transitions to adifferent state. The low power (hibernate) state can correspond to thehibernate state of a mobile device, or the PS1-PS3 state of an NVMe/PCIedevice.

Subsequent to the idle state 1314, the memory sub-system begins atransition to a sleep state 1326. The sleep state can be a state inwhich the system is shut down, e.g., not performing operations.Calibrations scans are not ordinarily performed when the system is inthe sleep state, but can be performed for a short period of time (shownas a deferral state 1324), prior to entering the sleep state, using asleep deferment feature. The memory sub-system can receive a sleeprequest 1320 or other notification that the power state is to transitionto the sleep state 1326. The memory sub-system controller can thenrequest a deferral of the sleep state to perform calibration scanoperations in a deferral time period prior to entering the sleep state1326. The system enters the sleep state 1326 in response to expirationof the deferral time period. The deferral time period can be, forexample, 1 millisecond, or other suitable time. The deferral time periodis shown as the deferral state 1324. During the deferral time period,the memory sub-system can perform one or more deferral scan iterations1322. The deferral scan iterations can be performed on high-prioritybins or block families, e.g., the oldest bins, or block families thathave been identified as having high error rates. As shown in FIG. 13 ,the deferral scan iterations 1322 are complete prior to the end of thedeferral time period that corresponds to the deferral state 1324. Uponexpiration of the deferral time period, the system enters the sleepstate 1326. The sleep state can correspond to the deep sleep state of amobile device, or the PS4 state of an NVMe/PCIe device, for example.

FIG. 14A is a flow diagram of an example method 1400 to performcalibration scans in which the scan frequency varies based on powerstate, in accordance with aspects of the present disclosure. The method1400 can be performed by processing logic that can include hardware(e.g., processing device, circuitry, dedicated logic, programmablelogic, microcode, hardware of a device, integrated circuit, etc.),software (e.g., instructions run or executed on a processing device), ora combination thereof. In some embodiments, the method 1400 is performedby the block family manager component 113 of FIG. 1 . Although shown ina particular sequence or order, unless otherwise specified, the order ofthe processes can be modified. Thus, the illustrated embodiments shouldbe understood only as examples, and the illustrated processes can beperformed in a different order, and some processes can be performed inparallel. Additionally, one or more processes can be omitted in variousembodiments. Thus, not all processes are required in every embodiment.Other process flows are possible.

At operation 1402, the processing device initiates a block familycalibration scan of the memory device. The calibration scan comprises aplurality of scan iterations, each of which is initiated in accordancewith a scan frequency. Each scan iteration comprises the operations1404-1408. At operation 1404, the processing device detects a transitionassociated with the memory device from a first power state to a secondpower state. At operation 1406, the processing device, responsive todetecting the transition from the first power state to the second powerstate, determines an updated value of the scan frequency based on thesecond power state. One or more subsequent scan iterations are initiated(e.g., by one or more subsequent executions of operation 1402) inaccordance with the updated value of the scan frequency. At operation1408, the processing device performs one or more calibration operationsby updating at least one bin pointer of at least one block family basedon a data state metric of at least one block of the at least one blockfamily,

FIG. 14B is a flow diagram of an example method 1420 to performcalibration scans in which the scan frequency varies based on powerstate, in accordance with aspects of the present disclosure. The method1420 can be performed by processing logic that can include hardware(e.g., processing device, circuitry, dedicated logic, programmablelogic, microcode, hardware of a device, integrated circuit, etc.),software (e.g., instructions run or executed on a processing device), ora combination thereof. In some embodiments, the method 1420 is performedby the block family manager component 113 of FIG. 1 . Although shown ina particular sequence or order, unless otherwise specified, the order ofthe processes can be modified. Thus, the illustrated embodiments shouldbe understood only as examples, and the illustrated processes can beperformed in a different order, and some processes can be performed inparallel. Additionally, one or more processes can be omitted in variousembodiments. Thus, not all processes are required in every embodiment.Other process flows are possible.

At operation 1422, the processing device determines whether the systempower state is an idle state. If so, at operation 1424, the processingdevice increases the calibration scan frequency to perform as manycalibration operations as possible, and execution subsequently continuesat operation 1426. If operation 1422 determines that the power state isnot idle, then execution continues at operation 1426.

At operation 1426, the processing device determines whether the systempower state is a low power (e.g., hibernate) state. If so, at operation1428, the processing device schedules one or more wakeups to occur at aspecified frequency to perform calibration scan operations, andexecution subsequently continues at operation 1430. If operation 1426determines that the power state is not idle, then execution continues atoperation 1430.

At operation 1430, the processing device determines whether the host hasrequested that the system enter a sleep power state. For example, theprocessing device can receive a notification from the host indicatingthat the host is requesting a sleep power state. If the host hasrequested that the system enter the sleep power state, the processingdevice defers the entry into the sleep state and performs priority scanoperations during deferral period. Upon expiration of the deferralperiod, the host enters the sleep power state. Although particular powerstates and corresponding actions related to performing calibration scansare described herein, other power states are contemplated, and anysuitable actions related to performing calibration scans can beperformed by the memory sub-system.

FIG. 15 illustrates an example machine of a computer system 1500 withinwhich a set of instructions, for causing the machine to perform any oneor more of the methodologies discussed herein, can be executed. In someembodiments, the computer system 1500 can correspond to a host system(e.g., the host system 120 of FIG. 1 ) that includes, is coupled to, orutilizes a memory sub-system (e.g., the memory sub-system 150 of FIG. 1) or can be used to perform the operations of a controller (e.g., toexecute an operating system to perform operations corresponding to theblock family manager component 153 of FIG. 1 ). In alternativeembodiments, the machine can be connected (e.g., networked) to othermachines in a LAN, an intranet, an extranet, and/or the Internet. Themachine can operate in the capacity of a server or a client machine inclient-server network environment, as a peer machine in a peer-to-peer(or distributed) network environment, or as a server or a client machinein a cloud computing infrastructure or environment.

The machine can be a personal computer (PC), a tablet PC, a set-top box(STB), a Personal Digital Assistant (PDA), a cellular telephone, a webappliance, a server, a network router, a switch or bridge, or anymachine capable of executing a set of instructions (sequential orotherwise) that specify actions to be taken by that machine. Further,while a single machine is illustrated, the term “machine” shall also betaken to include any collection of machines that individually or jointlyexecute a set (or multiple sets) of instructions to perform any one ormore of the methodologies discussed herein.

The example computer system 1500 includes a processing device 1502, amain memory 1504 (e.g., read-only memory (ROM), flash memory, dynamicrandom access memory (DRAM) such as synchronous DRAM (SDRAM) or RDRAM,etc.), a static memory 1506 (e.g., flash memory, static random accessmemory (SRAM), etc.), and a data storage system 1518, which communicatewith each other via a bus 1530.

Processing device 1502 represents one or more general-purpose processingdevices such as a microprocessor, a central processing unit, or thelike. More particularly, the processing device can be a complexinstruction set computing (CISC) microprocessor, reduced instruction setcomputing (RISC) microprocessor, very long instruction word (VLIW)microprocessor, or a processor implementing other instruction sets, orprocessors implementing a combination of instruction sets. Processingdevice 1502 can also be one or more special-purpose processing devicessuch as an application specific integrated circuit (ASIC), a fieldprogrammable gate array (FPGA), a digital signal processor (DSP),network processor, or the like. The processing device 1502 is configuredto execute instructions 1526 for performing the operations and stepsdiscussed herein. The computer system 1500 can further include a networkinterface device 1508 to communicate over the network 1520.

The data storage system 1518 can include a machine-readable storagemedium 1524 (also known as a computer-readable medium) on which isstored one or more sets of instructions 1526 or software embodying anyone or more of the methodologies or functions described herein. Theinstructions 1526 can also reside, completely or at least partially,within the main memory 1504 and/or within the processing device 1502during execution thereof by the computer system 1500, the main memory1504 and the processing device 1502 also constituting machine-readablestorage media. The machine-readable storage medium 1524, data storagesystem 1518, and/or main memory 1504 can correspond to the memorysub-system 150 of FIG. 1 .

In one embodiment, the instructions 1526 include instructions toimplement functionality corresponding to a block family managercomponent (e.g., the block family manager component 153 of FIG. 1 ).While the machine-readable storage medium 1524 is shown in an exampleembodiment to be a single medium, the term “machine-readable storagemedium” should be taken to include a single medium or multiple mediathat store the one or more sets of instructions. The term“machine-readable storage medium” shall also be taken to include anymedium that is capable of storing or encoding a set of instructions forexecution by the machine and that cause the machine to perform any oneor more of the methodologies of the present disclosure. The term“machine-readable storage medium” shall accordingly be taken to include,but not be limited to, solid-state memories, optical media, and magneticmedia.

Some portions of the preceding detailed descriptions have been presentedin terms of algorithms and symbolic representations of operations ondata bits within a computer memory. These algorithmic descriptions andrepresentations are the ways used by those skilled in the dataprocessing arts to most effectively convey the substance of their workto others skilled in the art. An algorithm is here, and generally,conceived to be a self-consistent sequence of operations leading to adesired result. The operations are those requiring physicalmanipulations of physical quantities. Usually, though not necessarily,these quantities take the form of electrical or magnetic signals capableof being stored, combined, compared, and otherwise manipulated. It hasproven convenient at times, principally for reasons of common usage, torefer to these signals as bits, values, elements, symbols, characters,terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar termsare to be associated with the appropriate physical quantities and aremerely convenient labels applied to these quantities. The presentdisclosure can refer to the action and processes of a computer system,or similar electronic computing device, that manipulates and transformsdata represented as physical (electronic) quantities within the computersystem's registers and memories into other data similarly represented asphysical quantities within the computer system memories or registers orother such information storage systems.

The present disclosure also relates to an apparatus for performing theoperations herein. This apparatus can be specially constructed for theintended purposes, or it can include a general purpose computerselectively activated or reconfigured by a computer program stored inthe computer. Such a computer program can be stored in a computerreadable storage medium, such as, but not limited to, any type of diskincluding floppy disks, optical disks, CD-ROMs, and magnetic-opticaldisks, read-only memories (ROMs), random access memories (RAMs), EPROMs,EEPROMs, magnetic or optical cards, or any type of media suitable forstoring electronic instructions, each coupled to a computer system bus.

The algorithms and displays presented herein are not inherently relatedto any particular computer or other apparatus. Various general purposesystems can be used with programs in accordance with the teachingsherein, or it can prove convenient to construct a more specializedapparatus to perform the method. The structure for a variety of thesesystems will appear as set forth in the description below. In addition,the present disclosure is not described with reference to any particularprogramming language. It will be appreciated that a variety ofprogramming languages can be used to implement the teachings of thedisclosure as described herein.

The present disclosure can be provided as a computer program product, orsoftware, that can include a machine-readable medium having storedthereon instructions, which can be used to program a computer system (orother electronic devices) to perform a process according to the presentdisclosure. A machine-readable medium includes any mechanism for storinginformation in a form readable by a machine (e.g., a computer). In someembodiments, a machine-readable (e.g., computer-readable) mediumincludes a machine (e.g., a computer) readable storage medium such as aread only memory (“ROM”), random access memory (“RAM”), magnetic diskstorage media, optical storage media, flash memory components, etc.

In the foregoing specification, embodiments of the disclosure have beendescribed with reference to specific example embodiments thereof. Itwill be evident that various modifications can be made thereto withoutdeparting from the broader spirit and scope of embodiments of thedisclosure as set forth in the following claims. The specification anddrawings are, accordingly, to be regarded in an illustrative senserather than a restrictive sense.

What is claimed is:
 1. A system comprising: a memory device; and aprocessing device, operatively coupled with the memory device, toperform operations comprising: determining a calibration scan frequencybased on an amount of elapsed time since a previous write operation wasperformed on the memory device; determining, based on the calibrationscan frequency, whether one or more scan criteria are satisfied;responsive to determining that the one or more scan criteria aresatisfied, identifying one or more block families; and scanning one ormore bin pointers of each of the identified one or more block families,wherein the scanning the one or more bin pointers comprises: for each ofthe identified one or more block families, updating each of the one ormore bin pointers of the identified block family based on a data statemetric of at least one block of the identified block family.
 2. Thesystem of claim 1, the operations further comprising: identifying atleast one second block family of the identified block families, the atleast one second block family having an error rate greater than athreshold error rate; and prioritizing the at least one second blockfamily in the scanning.
 3. The system of claim 2, wherein prioritizingthe at least one second block family causes one or more bin pointers ofthe at least one second block family to be scanned prior to scanning oneor more bin pointers of at least one other block family of theidentified block families.
 4. The system of claim 2, whereinprioritizing the at least one second block family comprises: adding theat least one second block family to a list of prioritized blockfamilies, wherein block families in the list are scanned prior to blockfamilies not in the list.
 5. The system of claim 1, wherein the one ormore scan criteria are based on a power state associated with the memorydevice, and wherein the one or more scan criteria are further based on adetermination that the memory device is in an idle power state.
 6. Thesystem of claim 1, wherein the one or more scan criteria are satisfiedin response to a host system associated with the memory devicetransitioning to an active power state from at least one of a sleepstate or low power state.
 7. The system of claim 1, wherein the one ormore scan criteria comprise a scan frequency, and wherein the scanfrequency is determined based on a write workload characterized by anumber of write requests received form a host system associated with thememory device.
 8. The system of claim 7, wherein the write workloadcomprises one or more of: a rate at which the write requests arereceived from the host system or an amount of data written by the writerequests received from the host system.
 9. The system of claim 1,wherein the one or more scan criteria comprise a scan frequency based ona number of program/erase cycles associated with the memory device. 10.The system of claim 1, wherein the one or more scan criteria comprise ascan frequency based on one or more of a read error rate of the memorydevice or a raw bit error rate (RBER) of the memory device.
 11. Thesystem of claim 10, wherein the operations further comprise: in responseto an increase of at least a threshold amount in the read error rate ofthe memory device, prioritizing, in the scanning, one or more blockfamilies associated with a block being currently scanned.
 12. The systemof claim 11, wherein prioritizing comprises scanning the one or moreblock families associated with the block currently being scanned priorto scanning one or more other block families.
 13. The system of claim 1,wherein each block family comprises a plurality of blocks of the memorydevice that have been programmed within a corresponding time window. 14.The system of claim 1, wherein the identifying the one or more blockfamilies comprises: identifying at least one first voltage bin, whereineach first voltage bin is associated with a plurality of read leveloffsets; and identifying, according to a block family creation order, anoldest block family from a plurality of associated with the firstvoltage bin, wherein the identified block families comprise the oldestblock family.
 15. The system of claim 14, wherein updating each of theone or more bin pointers comprises: determining one or more values ofthe data state metric based on at least one block of the oldest blockfamily; identifying a second voltage bin based on the one or more valuesof the data state metric; and associating the oldest block family withthe second voltage bin.
 16. A method comprising: determining, by acomputing device, a calibration scan frequency based on an amount ofelapsed time since a previous write operation was performed on a memorydevice; determining, based on the calibration scan frequency, whetherone or more scan criteria are satisfied; responsive to determining thatthe one or more scan criteria are satisfied, identifying one or moreblock families; and scanning one or more bin pointers of each of theidentified one or more block families, wherein the scanning the one ormore bin pointers comprises: for each of the identified one or moreblock families, updating each of the one or more bin pointers of theidentified block family based on a data state metric of at least oneblock of the identified block family.
 17. The method of claim 16,further comprising: identifying at least one second block family of theidentified block families, the at least one second block family havingan error rate greater than a threshold error rate; and prioritizing theat least one second block family in the scanning.
 18. The method ofclaim 17, wherein prioritizing the at least one second block familycauses one or more bin pointers of the at least one second block familyto be scanned prior to scanning one or more bin pointers of at least oneother block family of the identified block families.
 19. Anon-transitory machine-readable storage medium storing instructions thatcause a processing device to perform operations comprising: determininga calibration scan frequency based on an amount of elapsed time since aprevious write operation was performed on a memory device; determining,based on the calibration scan frequency, whether one or more scancriteria are satisfied; responsive to determining that the one or morescan criteria are satisfied, identifying one or more block families; andscanning one or more bin pointers of each of the identified one or moreblock families, wherein the scanning the one or more bin pointerscomprises: for each of the identified one or more block families,updating each of the one or more bin pointers of the identified blockfamily based on a data state metric of at least one block of theidentified block family.
 20. The non-transitory machine-readable storagemedium of claim 19, the operations further comprising: identifying atleast one second block family of the identified block families, the atleast one second block family having an error rate greater than athreshold error rate; and prioritizing the at least one second blockfamily in the scanning.